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Preface 


This book is actually an expanded version of my first book Introduction to 
Special Theory of Relativity published by Allied Publishers in 1998. The 
reason I wanted to write another book on the same subject is that I pub- 
lished several articles and papers over the last two decades in the journal 
Physics Education (on-line since 2013) brought out by the Indian Associ- 
ation of Physics Teachers. These articles were intended to give students 
and teachers a better grip of some key concepts associated with Relativity, 
both Special and General. I have incorporated these articles in this book 
by merging them within its chapters and sections. The reader will find the 
original articles in the Bibliography. Their references are shown in the next 
paragraph in square brackets []. My treatments of these concepts are sup- 
posed to give additional strength to this book, and constitute a new feature 
of this book. 

I mention some of these features here: An elaborate introduction to 
tensors and stress tensors; Maxwell’s stress tensor and conservation of 
momentum in electromagnetic fields [17]; graphical construction of Lorentz 
transformation, of time dilation, length contraction and simultaneity para- 
dox [41]; magnetism as a relativistic effect [35]; Minkowski’s equation of 
motion illustrated by a mathematical treatment of relativistic rocket [27]; 
energy tensor for the electromagnetic field, and of a system of incoherent 
charged dust creating its own electromagnetic field [39]. 

Einstein’s two original papers that gave the world the Special The- 
ory of Relativity, and how the well-known phenomenon of electromagnetic 
induction led him to this theory, have been detailed in my article 1905 Rel- 
ativity Papers of Einstein [40]. How Minkowski’s further pursuit resulted 
in a four-dimensional geometrical world-view of events, and subsequently 
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of the dynamical variables of mechanics, and of electrodynamics, has been 
detailed in another of my article Minkowski’s Space-Time [41]. These two 
supplementary resource materials are not a part of this book, but they can 
be downloaded from my website, cited at the end of this Preface. 

This book is comprised of four parts divided into twelve chapters. Part 
I starts with a story of Special and General Relativity in brief, and is 
intended for a layman (Chapter 1). I have outlined the basic tenets of 
Special Relativity (Chapter 2), followed by Lorentz transformation (Chap- 
ter 3) and relativistic mechanics (Chapter 4). I have worked out many 
innovative problems and exercises. Some of them were designed to remove 
any doubt centred on relativity paradoxes, like simultaneity, time dila- 
tion, and length contraction. In fact my derivation of Lorentz transfor- 
mation is itself an exercise in this direction. The detailed worked-out prob- 
lems on Lorentz transformation and relativistic mechanics are intended to 
strengthen the reader’s understanding. It is hoped that an undergradu- 
ate student studying physics will not only understand and enjoy this part 
thoroughly but also derive all essential knowledge about Special Relativity 
from it. 

The rest of this book is intended for a reader seeking advanced knowl- 
edge, in particular the covariant language, in which advanced texts in Clas- 
sical Electrodynamics are written. This is where its wider reach is discov- 
ered and the journey to General Relativity begins. 

The entire Part II (Chapters 5 and 6) is an introduction to tensors, 
with special emphasis on the stress tensor. The reader will discover to his 
amazement and enlightenment that empty space loaded with an electro- 
magnetic field has stresses developing within it, just like a beam loaded 
with bricks, or any other ordinary matter, like solids and fluids, subjected 
to external forces. The special name for this stress is Mazwell’s stress ten- 
sor, and constitutes an important ladder to the energy tensor which I have 
explained in Part IV. 

Part III takes the reader into the four-dimensional world of Minkowski’s 
space-time. The introduction to tensors initiated in Part II, where it was 
restricted to three dimensions, now begins to give rich dividends in four 
dimensions, with terms like contravariant, covariant, raising and lowering of 
indices, and an introduction to the all-important metric tensor (Chapter 7). 

But what is the need for this esoteric trip to four dimensions? Because 
without it, it would be impossible to look at relativistic mechanics in 
a holistic manner. Without it how would one explain that energy and 
momentum together form one unit, the 4-momentum (for which I have 
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coined a new name En-Mentum, to remind the reader of its four com- 
ponents and their sequence), and transforms as one unit under Lorentz 
Transformation; or 4-force (Pow-Force)? I have shown Lorentz transfor- 
mation of a 4-vector, and specialized it to En-Mentum and Pow-Force and 
explained their significance and corollaries. Then I worked out Minkowski’s 
equation of motion (EoM), and conservation of energy and momentum as 
a single law of physics (Chapter 8). The Minkowski EoM is best. illus- 
trated by its application to relativistic rocket (Chapter 9), and the Lorentz 
transformation of 4-force by magnetism as a relativistic effect (Chapter 
10). The Principle of Covariance, for which the covariant equations of 
electrodynamics stand as a shining example, is the culmination of this 
trip (Chapter 11). 

Pat IV, titled Physics of a Relativistic Continua, has only one very 
important chapter, namely Chapter 12, dedicated to the energy tensor. 
We take a brief look at the non-relativistic EoM of a perfect fluid, known 
as Euler’s equation, and extend the lessons to relativistic perfect fluid, 
which, in combination with the energy conservation of electromagnetic field, 
completes the energy tensor. 

Writing the former edition of this book, and its expansion to this version 
was a major challenge with manifold obstacles in the way. I owed it to the 
educational values inculcated at the hallowed precincts of the University of 
Illinois at Urbana-Champaign, and to the professional ethics of my great 
teachers, to face these challenges with determination in order to “present a 
true account” of the gifts I have inherited from my Alma Mater. My books 
on Special Relativity and Mechanics are a presentation of this account in 
a humble way. With deepest reverence and respect I recall some of my 
mentors: Professor Dillon Mapother (who was instrumental in my changing 
over from civil engineering to physics), Professor James H. Smith (from 
whom I learnt Special Theory of Relativity), Professor O. Hanson, Professor 
G. C. McVittie (legendary authority in General Relativity who taught me 
an introductory course on this subject), Professor Yavin, and Professor 
Peter Axel, to name a few. 

The memory of Professor James Allen, my research advisor, is etched 
permanently on my mind. He stood by me during some of my most trying 
times. To me he was the most shining example of love and kindness. 

There were two other persons who left their indelible footprints on 
the seashore of my life: Dr. Pratap Chandra Chunder, former Minister of 
Human Resource Development, Government of India, and Dr. Rais Ahmed, 
former Director of the NCERT. 
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Fig. 1. Alma Mater Statue at the University of Illinios 


The entire typesetting of this book, up to its last details, plotting of 
graphs and drawings were done by me, under the operating system Linux 
Mint 17.2. I used 


e The document preparation system /TpX 2¢ for typesetting texts 
and equations, however difficult and complex they might appear 
before our eyes, 

e Kile 2.1 for making the typesetting and editing operations easy, 

e Gnuplot for preparing plots and graphs of mathematical equations, 
however difficult they may be, 

e Maxima for evaluating most difficult integrals, which are impossible 
to perform manually, 

e Xfig for the drawings and integrating all plotted graphs in the 
drawings, and 

e GIMP for integrating .jpg images into this book and into my pre- 
vious book Mechanics. 
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I bow my head in humble respect for those who gave their precious 
times to write these programs for the students and teachers of the world, 
and distributed them free of cost. 

I got my first tutorials in Linux, YTRX and Xfig from my first son-in-law 
Michael Murphy way back in 1995, and guidance in the use of Gnuplot and 
Maxima from my second son-in-law G. R. Santhosh. 

There were times when I was at my wit’s end, whether in understanding 
certain aspects of Relativity or in overcoming typesetting problems under 
TI4TpxX 2-. I rushed to Professor A. V. Gopala Rao, himself a relativist of 
great repute, who gave his time liberally to help me out of the difficulties. 

Professor Ashok Singal, Department of Astrophysics at Physical 
Research Lab, Ahmedabad, read my manuscript patiently and corrected 
the errors and mistakes that came to his notice. Without his watchful eyes 
some embarrassing errors would have gone into my book undetected. 

The final formatting of my manuscript to fit into the specified page size, 
has been done so meticulously by Mr. Kah-Fee Ng, Senior Editor, World 
Scientific Publishing Company. My special thanks to him. 

I shall mention a few more names: My former students Lakshmi 
Narayanan and B. Rajeswari who helped me in publishing this book; Mr. 
K. S. Venkatesh, CEO, QDP Technologies from whom I received all help 
to sort out my computer problems. 

I am grateful to Dr. H. Basavana Gowda, Cardiologist and Principal 
JSS Medical College, Mysuru, Dr. B. S. Ramesh, Radiation Oncologist 
at HCG Hospital, Bangalore, Dr. K. G. Srinivas, Medical Oncologist at 
Bharat Hospital and Institute of Oncology, Mysuru for their medical guid- 
ance during my most difficult and trying times. I am indebted to Dr. 
Shankara Narayana Jois, my Yoga Guru for guiding me to a healthier life 
and to Dr. K. V. Ravishankar of Usha Kiran Eye Hospital, Mysuru for 
helping my wife and me protect and maintain our vision as we age. 

My granddaughter Barsha Manjari Kush has been the inspiration 
behind all my creative works in physics and music. My wife Aloka, and 
my two daughters Anuradha and Madhusmita propped up my sinking spir- 
its when I lost hopes of completing my multi-dimensional projects. 

I have been enjoying Fuller Fund Membership of the American Associa- 
tion of Physics Teachers since October 2001. This allowed me access to the 
American Journal of Physics, crucial for writing papers in Classical Elec- 
trodynamics and Special Relativity. I thank Prof. Rogers Fuller, Associate 
Director of Membership, AAPT, and Harold Q and Charlotte Mae Fuller 
for this precious gift. 
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I shall conclude by citing my website 
http://sites.google.com/site/physicsforpleasure 


where the readers will find many of my physics experiments, papers and 
articles (published or unpublished), and my music projects. 

I conclude by wishing the readers an enjoyable experience in going 
through this book. 


Somnath Datta 
Mysuru, November 30, 2020 
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Chapter 1 


What Is Relativity? 


1.1. Influence of the Human Value System on the 
Evolution of Physical Theories 


The physical laws, the way we understand them, are man-made constructs 
of mathematical models, capable of discerning order and pattern in a myriad 
manifestations of the insentient, impersonal and impercipient nature. They 
are a code of conduct, believed to be obeyed by all children of nature — 
particles and objects of various hues and forms, forces and fields — in 
their mutual interactions, propagation and dynamical behaviour — as they 
weave a web of evolution through the vast expanse of space and time. It 
is not unnatural, therefore, that these codes of the impercipient world — 
being codification by the sentient human mind — should borrow from the 
ambience of human society, concepts, models, imageries and even value 
systems. 

For example, the principle of least action — which asserts that all 
dynamical processes of change in the universe are motivated by a propen- 
sity to extremize the “action integral” — which is the Lagrangian function 
summed over time — is an adaptation from human society of man’s own 
drive for optimizing the fruits of his efforts and the value for his money. 
As a corollary, it is the most sensible thing for an insentient electron or a 
missile to reach its target by that shortest route which also takes into con- 
sideration the compulsions of a prevailing ambient electric or gravitational 
field, just as it is the most sensible thing for a sentient human to reach 
his destination by that shortest route which also takes into consideration 
the various compulsions of his mission, places of special interest as well as 
lurking dangers in an unfamiliar terrain. 
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As another example, the facility with which many of us handle, and even 
enjoy, trigonometrical exercises, lends a natural instinct to extend geomet- 
rical models for unfathoming mysteries of the physical nature. And what 
else can be a simpler geometrical object than a straight line of a measured 
length and orientation — more descriptively, a directed line segment — 
popularly known as a vector? Naturally, therefore, all of classical physics 
is dominated by the vector model of various physical quantities — like 
force, velocity, momentum, electric and magnetic fields and others — each 
one of which is modelled as the segment of a straight line whose changing 
length and direction tell us all about the dynamical nature of the proto- 
type. Understanding, as well the application, of physical laws thus greatly 
simplify to the construction of straight lines and triangles, and interpreting 
them with the help of trigonometrical formulas. 

With the passage of time, human society has changed and its value 
systems have transformed, leaving their mark on the evolution of physi- 
cal theories. The emergence of democracy as a universally accepted social 
order, for instance, also contained the seeds of two powerful streams of 
physical theories that have occupied the centre stage since the turn of the 
20th century. One of them, called quantum mechanics, starts with Ein- 
stein’s interpretation of photo-electric effect in terms of light quanta and 
de Broglie’s hypothesis of wave-particle duality. In effect, they served to 
remove any fundamental distinction between the world of radiation and the 
world of matter — treating both on the same footing, endowing both with 
wave and corpuscular attributes. The other stream of thought, called the 
theory of relativity proclaims democracy of all frames of reference. It starts 
with an outright rejection of the prevailing notion of an absolute frame, just 
as democracy starts with an outright rejection of an absolute ruler. Rela- 
tivity teaches us to look upon all frames of reference as equivalent, just as 
democracy teaches us to treat different frames of opinion with equal respect. 


1.2. Rejection of Absolute Frame 


A frame of reference is a set of permanent benchmarks with respect to 
which the coordinates of all moving objects are defined. To concretize the 
idea, a frame of reference is often viewed in classical physics as a set of 
three mutually perpendicular rigid bars extending up to infinity (Fig. 1.1). 

These bars serve as the X-, Y- and Z-axes with respect to which the 
coordinates (a, y, z) of a moving object are measured. Time rates of change 
of these coordinates lead to velocity, momentum, acceleration and a host of 
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Fig. 1.1. Coordinates of a projectile. 


other derived physical quantities. Systematic observation, followed by care- 
ful study and analysis of the collected data, often leads to the speculation 
of certain types of temporal, spatial and interconnecting relations among 
such physical quantities, which are enunciated as physical laws. From its 
very nature, therefore, a law is presumed to be valid in one particular frame 
namely the one from which, or with respect to which, the observations were 
made. 

The apparent diurnal motion of stars and planets across the sky, for 
instance, led to the earlier Aristotelian speculation of a geocentric model in 
which the earth was taken to be a “fixed frame” of reference at the centre 
of the universe. In the 15th century, Nicolaus Copernicus disbanded the 
geocentric model in favour of a heliocentric universe in which the sun is the 
centre of a fixed frame of reference with the entire clan of “fixed stars” form- 
ing a backdrop of reference points against which the earth, the moon and 
the other planets and their satellites moved. From the astronomical data 
of these motions, collected by Tycho Brahe and mathematically analyzed 
by Johannes Kepler, Newton arrived at the Universal Law of Gravitation. 

Thus, at least till the end of the 19th century, man thought only in 
terms of one supreme standard of frame of reference, the so-called Absolute 
Frame (often intuitively identified with the sun and the “fixed stars” ) which 
also served as the standard of absolute rest. This also defined “motion” 
in an absolute sense, to be one in which the coordinates of an object, as 
measured in the absolute frame, are seen to be changing with time. Theory 
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of relativity brings about a complete and radical overthrow of that atti- 
tude by declaring that such a notion of Absolute Frame, absolute motion, 
absolute rest are only a myth, which can never be substantiated by any 
experiment. Relativity asserts that all motions are relative, all states of 
rest are also relative — relative to the particular frame of reference we have 
fixed for the convenience of our observation and calculations. The so-called 
Absolute Frame does not exist, because there is no special attribute, spe- 
cial quality that can be experimentally measured, and then recognized to 
be an exclusive preserve of any particular frame. The laws of physics work 
equally well in all frames of reference, provided we write them in the correct 
mathematical language. 


1.3. Relativity Principle: A Rudimentary Form 


We can explain this in the following way. Suppose, either by subjective 
judgement or by established convention, one particular frame has long been 
looked upon as the standard, which, for fixing ideas, we take as the one riv- 
eted on the surface of the earth. A team of experimenters had discovered 
the laws of physics by performing a chain of experiments in this frame 
and analyzing the table of collected data. The same team now decides to 
repeat these experiments inside a mobile laboratory set up on a train which 
is coasting smoothly along a straight line with uniform velocity. (Imagine 
the journey to be so smooth that not even a sound or jerk comes from the 
rails.) Relativity foretells that the results of this second series of experi- 
ments will be identical to those in the first, that the new set of data tables 
will replicate the older one. This implies that the same laws of physics 
are at work when observed from the ground as when observed from the 
train. This also means that there is no experimental way of discriminating 
between the two frames, or of establishing any special feature shared by 
one of them — say the ground frame — and not by the other. This being 
the case, the experimenters have no objective means of deciding which one 
of the two frames is moving in an absolute sense and which one is sta- 
tionary. They can, however, “look out”, see the trees, the mileposts, the 
hills and the rivers passing them by, and thereby conclude that their lab 
is moving relative to the ground. Alternatively, they can also believe that 
the ground — along with the hills and the trees — is moving while his 
lab is stationary. Relativity considers both these viewpoints equally valid 
and prohibits any objective judgement about who is moving and who is 
stationary, because motion is always and intrinsically relative. 
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We can now summarize the above experience as a limited relativity 
principle in the following not-so-precise form. Two frames of references that 
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are “non-rotating”* and moving with uniform velocity with respect to each 
other are equivalent and indistinguishable in all respects. This statement 


forms the core of the special theory of relativity. 


1.4. Inertial Forces 


If we take this viewpoint somewhat far, we can even elevate the discarded 
geocentric view of the universe from the dustbin of history to a status 
of equality with the heliocentric view. To be more precise, let us define 
the geocentric frame of reference as three mutually perpendicular rigid 
bars (serving as X-, Y-, Z-axes) riveted to the body of the earth and 
stretching up to infinity. The spirit of relativity would seem to suggest 
that this geocentric frame and the heliocentric one are equivalent. The 
universe revolving around the earth, and the earth revolving around the 
sun, are two apparently different, but legitimate, modes of description of 
the same schemes of nature at work. 

The reader will probably react to the above suggestions with disbelief. 
The universe with the stars and mighty galaxies all going around us! And 
to expect a modern scientific mind of the 20th century (or, 21st century) 
to place such a nonsense on par with the respectable and well-founded 
theory of the earth’s absolute rotation around its axis! Isn’t the fact that 
the earth is slightly thicker around the equator — making it look more 
like an oblate spheroid than a sphere — enough evidence that the earth is 
absolutely rotating? The necessary flattening force on the earth — known 
as the centrifugal force — comes only because the earth is turning. How 
can it originate if the earth is believed to be stationary and, instead, the 
universe is made to revolve around? 

The centrifugal force, cited in the above paragraph as an evidence of the 
earth’s rotation, is an example of a class of inertial forces which — as every 
student of mechanics knows — needs to be “invented” in order to make 
Newton’s second law of motion valid in a general accelerating frame (within 
the class of which we shall include also a rotating frame of reference). Such 
inertial forces cannot be real, because they do not originate from any mate- 
rial source (the way gravity forces and electro-magnetic forces do). Being a 


*Fix three gyroscopes with their axes perpendicular to each other. If the axes remain 
fixed in directions, then the frame of reference is non-rotating. 
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parentless, illegitimate child, an inertial force is often called a “fictitious 
force”. Anyone who analyzes the motion from a “non-accelerating” frame 
does not see any inertial force at all. 

Inertial forces are daily encountered by every commuter when the train 
or the bus he rides suddenly starts or suddenly halts. At such moments 
he feels a jerky backward thrust or a forward pull — which are examples 
of inertial forces. An observer who watches the motion from the ground 
would argue that the sudden backward or forward movement of the body 
of the commuter is not due to a real force, but due to inertia inherent in 
the commuter’s body, in conformity with Newton’s first law of motion. 

As a general rule, an inertial force —ma is “imagined” to be acting on an 
object of mass m, when viewed from a frame of reference which is moving 
with acceleration a. Anyone trying to stand, or walk on a merry-go-round 
experiences two kinds of inertial force, namely a centrifugal force and a 
Coriolis force. The first one of them is velocity independent, whereas the 
second one is strictly proportional to the velocity of the walker. These two 
forces are exactly analogous to the forces exerted on a particle carrying 
electrical charge e by an electric field E (force = eE), and a magnetic field 
B (force = ev x B, where v is the velocity) respectively. If a platform is 
rotating about an axis with an angular velocity w, then an object of mass 
m at a distance r from the axis experiences a centrifugal force directed 
outwards from the axis and having magnitude mw?r; whereas the Coriolis 
force is given by mv x 2w. A direct evidence of the existence of Coriolis 
force is provided by a Foucault pendulum, which is nothing but an ordinary 
pendulum with a rather heavy bob (so as to be relatively unaffected by air 
friction) and suspended by a very long thread from the ceiling of a very 
tall building. One such Foucault pendulum, which is in public display is 
the lobby of the UN headquarters in New York, is suspended from a 75-feet 
high ceiling. 

The plane of oscillation of this pendulum (or, in principle, any pendulum 
suspended from a fixed support in any earthly physics lab) will not be 
confined to a fixed vertical plane. Instead it will turn slowly with a period 
of rotation equal to 1/sin A days where X is the latitude of the location on 
the earth. 

Thus, we find that the slight bulging at the equatorial plane of the earth 
and the slow precession of the plane of oscillation of a Foucault pendulum 
show the existence of centrifugal and Coriolis forces at any place on the 
earth. They, in turn, provide the irrefutable evidence that the earth is 
turning around its axis. 
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There is another absurd implication of the geocentric view which the 
reader may not have missed. A galaxy which is, say, one billion light years 
away, will be orbiting a circular path of 27 billion light years in just one 
day — suggesting an incredible speed of 7302 x 10°c, where c is the speed 
of light. No physical theory will allow such a nonsense. 

The above example helps to underscore certain complexities and sub- 
tleties associated with the concept of frame of reference in the complete 
theory of relativity. It is not legitimate to think of a frame of reference 
as a non-rotating set of rigid bars stretching up to infinity. A frame of 
reference in relativity is a mathematical construct, sometimes lacking a 
complete visual picture. Two different frames of reference are related to 
each other by means of mathematical transformation equations satisfying 
certain conditions. With the choice narrowed down to legitimate frames of 
reference satisfying required conditions, the fundamental credo of relativity 
is still equal status for them all. We do not intend to pursue this argument 
further for the fear of straying away from our main objective. 

Fortunately, some of the above considerations do not fog the clarity of 
the special theory of relativity with which this book is primarily concerned. 
In this special theory, which Albert Einstein enunciated in 1905 while work- 
ing as a clerk in a patent office in Zurich, does make a distinction between 
a class of “privileged” frames called the inertial frames and the non-inertial 
ones. Without going into proper definition right now, let us accept naively 
that an inertial frame of reference is the archetype of non-accelerating, 
non-rotating frames. They are the ones in which the inertial forces are 
absent, so that Newton’s first law of motion (often called the law of inertia) 
is strictly valid. In other words, an inertial frame of reference is one with 
respect to which a point particle will continue to move along a straight line 
with uniform speed, so long as it is free from external forces. 

The relativity principle, enshrined in the special theory of relativity pro- 
claims that all inertial frames of reference are equivalent. This statement 
is a refinement of the relativity principle stated at the end of Sec. 1.3. 


1.5. Principle of Equivalence 


Even though our limited mission is an exposition of the special theory of 
relativity, it will be helpful to realize the central theme of relativity shared 
by both the general and the special theory. 

Einstein was not satisfied with the limited pronouncement of the rela- 
tivity principle in special relativity, in which the inertial frames have been 
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given a special status. In order to remove all distinctions between inertial 
and non-inertial frames, one has to appreciate the features by which one 
distinguishes a non-inertial frame from an inertial one. This feature is the 
appearance of the inertial forces in non-inertial frames, and its absence in 
the inertial frames, as already mentioned. 

Einstein himself considered the following thought experiment. Sup- 
pose, while you are inside an elevator, someone cuts the supporting cable. 
The elevator will be falling freely in the earth’s gravitational field. But so 
will you, with the same acceleration g of the elevator so that you will be 
floating inside. Seen in another way, there will be an inertial force —mg 
acting on you (see Sec. 1.4), which together with the gravitational force mg 
of the earth will result in zero force on you. As a consequence, you will be 
weightless, just like an astronaut inside his space lab orbiting around the 
earth. 

On the other hand, suppose your elevator is moved upwards with a 
constant acceleration g (so that the acceleration of the elevator is —g). 
Then you will feel twice as heavy. Because now the induced inertia force 
will be mg which, along with the existing gravitational force mg, will result 
in a total force of 2mg. 

Thus, the inertial force has the remarkable property that it can get 
mixed up with the gravitational force to cause either a total or partial 
cancellation of the same, or an enhancement, or even change in the direction 
and magnitude of the same. 

As a preamble to the General Theory of Relativity, therefore, Einstein 
proposed his famous Principle of Equivalence.© According to this principle, 
the inertial forces are equivalent to gravitational forces. One can generate 
the force of gravity of arbitrary direction and magnitude by accelerating or 
rotating his spaceship, or frame of reference suitably. One can, conversely, 
destroy the existing force of gravity at will by letting his frame of reference 
fall freely in the existing gravitational field. There is no experiment what- 
soever — either in electrodynamics, or in optics, or in any other discipline 
of physical science — which can differentiate between the effect of “true” 
gravity force (produced by the earth, or the sun) and the effect of inertial 
forces induced due to acceleration of his frame. In other words, at least the 


>See Ref. [2]. 

©Equivalence Principle, in its “weak form” and “strong form”, needs to be understood 
for a study of Relativity, in particular the General Theory of Relativity. Read what the 
masters have written on this [3-5]. 
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local effects of inertial forces are exactly the same as that of the true gravity 
forces, so much so that the inertial forces are also a kind of gravity forces. 

A trivial example of the equivalence principle is the prediction that light 
bends downwards under gravity. Consider the same elevator which has a 
hole A on its eastern wall. The elevator is accelerating with an acceleration 
g upwards in early morning, when a ray of light, progressing along a hori- 
zontal straight line, enters through A and falls on a mark B on the western 
wall. Let the time of flight of the light ray from A to B be t. In this time, 
the elevator has moved upwards by a distance s = sgt. Therefore, the 
mark B must be the same distance s below a corresponding horizontal line 
AH drawn inside the elevator. Since the accelerating elevator in gravity free 
space is equivalent to a stationary elevator under gravity, light must fall by 
the same distance in a gravitational field g. In other words, the trajectory 
of a photon deviates from its straight line path by bending towards a grav- 
itating mass. Such bending of a ray of light in a “true” gravitational field 
can be confirmed by measuring the deflection angle of a light ray when it 
grazes the periphery of the sun during a solar eclipse from its normal value. 
Experiments have confirmed this effect. 

In view of the equivalence principle, the concept of inertial frame — 
which is central in the theory of relativity — needs to be redefined. We can 
now define an inertial frame to be a frame of reference with respect to which 
all inertial and gravity forces are absent. In the vicinity of a gravitating 
mass (e.g. the earth) such a frame can be realized inside a non-rotating 
box which is falling freely in the given gravitational field. Einstein’s freely 
falling elevator and earth orbiting space-labs (which are also freely falling 
under gravity) provide best examples. 


1.6. Tidal Forces 


With the proclamation of the Equivalence Principle, gravity seems to dis- 
appear into thin air. We can create and generate the force of gravity at 
will by changing the frame of reference. Moreover, seen from a truly iner- 
tial frame, there is no gravity at all. What is then the fate of the Universal 
Law of Gravitation discovered by Newton, the triumph of which had been 
heralded by all the planets and satellites sailing in the sky? Does that also 
vanish into thin air? 

It is obvious that, since inertial forces and gravity forces are intimately 
intermixed, and since the inertial forces are off-springs of non-inertial 
frames, a truly generalized theory of relativity (i.e. the one that treats all 
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frames of reference on equal footing) must also be a theory of gravitation. 
Indeed, Einstein’s General Theory of Relativity — which he published in 
1916 — is also the most modern theory of gravitation. It was the towering 
achievement of the genius of Einstein to isolate “real” gravity (e.g. the one 
that is generated by a massive object, like the sun) from piles of “spurious” 
ones (i.e. the inertial forces posing as gravity). 

The local effects of both gravities being the same (both can be created, 
or destroyed by suitably selecting frames of reference; both have the same 
effect on material particles and light), can there be some global effects by 
which the “real” gravity can be isolated? 

Real gravity manifests itself in the phenomenon of tide, which is a global 
effect.¢ This effect can be seen in the deformation of an extended massive 
object falling freely under gravity. 

It should be easy to visualize the deformation of a spherical mass S 
of radius r which has been dropped from a height h above the surface 
of the Earth and falls vertically towards the Earth’s centre, as shown in 
Fig. 1.2(a). The particle A, being nearer to the earth than C experiences 
a larger gravitational force than C and falls faster than C. The particle B, 
being further, experiences a lesser gravitational force and falls slower. As a 
consequence C lags behind A, B lags behind C, and the diameter AB slowly 
elongates from r to r+ 0. 

The particles E and F fall with almost the same acceleration as C, along 
the radial lines EO and FO, joining to the centre of the Earth. However, 
these lines come closer as S comes closer to Earth. Therefore, the diameter 
EF contracts slowly from r to r —e. 

The net result is that a massive spherical ball which had been dropped 
from a height h above the surface of the Earth becomes an ellipsoid. An 
observer who sits inside a “rigid box” (which actually gets deformed by 
the tidal forces) shown by a rectangular frame, finds deformation of the 
originally spherical mass into an ellipsoid as the box falls from P to Q. 

If S is an entirely incoherent assembly of particles, falling toward a 
gravitating centre along an ellipse or a circle, its different parts will accel- 
erate and fall in their own ways. As a consequence they will disperse, 
fall apart, and S will hardly look like one body after some time. How- 
ever, extended objects, like comets, are not entirely incoherent, because 


dThe phenomenon of tide as the signature of real gravity is discussed by Taylor and 
Wheeler [6]. 
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deformed shape due to tidal forces 


original shape 


(a) (b) 


Fig. 1.2. Tidal deformation of a spherical object in free fall. 


the internal gravitational pull among different parts acts like a bond. But 
they become distorted. 

The same distortion will occur if the massive spherical ball had been 
a satellite of the earth, orbiting the earth in a circular orbit, as shown in 
Fig. 1.2(b). The mathematical analysis of this effect is not as simple as in 
the case of vertical free fall, but not so complicated either.® 

This distortion, due to differential accelerations of different parts 
of an extended body, is what we call tide. The regions of elongation, 
around A and B are the locations of high tide. The regions of contrac- 
tion around E and F are the locations of low tide. The real gravity 
is characterized by tidal forces. 


°See, for instance, [7]. Or, get the mathematical analysis with diagrams, in [8]. For a 
layman’s view, see [9]. 
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1.7. The Scheme of General Relativity 


Acceleration induced (spurious) gravity can be transferred away everywhere 
by suitably selecting a “global” inertial frame. “Real” gravity can be trans- 
formed away locally, but not globally. There does not exist a global inertial 
frame in the presence of a gravitating body like the sun or the earth. Per- 
haps the following examples will clarify. 

Imagine a frame of reference S freely falling under the gravitational 
pull of the earth (and at the same time going in a circular orbit) as shown 
in Fig. 1.3. From this frame, observe the motion of two balls A and B, 
both freely falling. Of the two, A is very near the origin of S, say within a 
“small” radius R (which has to be defined properly), whereas B is far away. 
In the frame S, A will be either stationary, or moving along a straight line 
with uniform velocity (at least for a limited span of time, depending on the 
initial velocity and initial location of A), whereas, B will be moving with a 
non-uniform speed. This means that the law of inertia is seen to be valid 
for over a limited region around the origin of S, and that this region, i.e. 
the radius R, shrinks smaller and smaller as time passes. The frame S is 
only locally inertial, locally with respect to both space and time. 


free-falling towards Earth 


as it orbits in a circle 


Fig. 1.3. Local inertial frame S. 
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A local inertial frame is analogous to Cartesian axes on the surface of 
a sphere. If the sphere is sufficiently large, like the earth, any city or town 
can be considered to be built on a surface which is flat locally, so that the 
map of the city can be drawn on a flat sheet of paper with Cartesian axes 
running in the W—E and S-N directions. On the other hand, it will not be 
possible to draw the map of the entire Asian continent, for instance, on a 
flat sheet of paper. 

The surface of the earth is locally flat, so that we can draw straight lines 
locally. But if we stretch two parallel straight lines too far, they will ulti- 
mately cross. In the same way, if we consider two balls, originally floating 
stationary near each other inside the “Einstein’s elevator” that is falling 
freely vertically downward, they will come closer to each other with the 
passage of time and will actually meet each other if the elevator is allowed 
to fall all the way to the centre of the earth. 

Einstein conjectured that if we consider the world line of an object 
moving in a gravitational field (world lines are trajectories of particles, 
photons, in a four-dimensional world, called space-time in which time also 
is a coordinate axis, discussed in Sec. 7.1), that world line will be the 
straightest possible path in a curved four-dimensional space-time. 

The straightest possible lines on a curved surface are called geodesics. 
Geodesics on the surface of our globe are also called great circles (e.g. the 
equator, the meridian circles). By analogy, Einstein proposed the famous 
geodesic hypothesis, according to which all freely falling objects, like the 
planets, satellites, moving in the gravitational fields of the sun or the earth, 
trace out geodesics (i.e. straightest lines) in a four-dimensional space-time. 
The path of a starlight progressing along a geodesic bends as it grazes the 
sun’s periphery, the effect discussed in Sec. 1.5. This bending of “straight 
lines” is the manifestation of curvature in the space-time. 

Coming back to the example of vertical free-fall of the particles E and 
F, alluded in Fig. 1.2(a), the geodesic lines of these two particles come 
closer to each other. This reminds us of the geodesic lines drawn on the 
surface of the earth meeting at some point. The geodesic lines that cross 
the equator perpendicularly converge at the North and South poles. This 
happens because the surface of the earth is curved. 

In the same way, the four-dimensional space-time is curved. There is 
a relative acceleration between two objects both of which are falling freely 
(the particles E and F in the above example). This relative acceleration, 
when seen in the four-dimensional space-time, constitutes the curvature of 
the space-time. “Gravitation is a manifestation of space-time curvature, 
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and that curvature shows up in the deviation of one geodesic from a nearby 
geodesic (relative acceleration of test particles).”! 

Einstein constructed the Curvature Tensor, reshaped it through math- 
ematical steps and identities into what is known as Einstein Tensor E” , 
and wrote the Field Equation of Gravitation in the esoteric form, known 


as Einstein Equation: 
81G 


py 
BP’ = a 


TH, (1.1) 


in which the source term T” on the right-hand side is the energy tensor. 
This term is a generalization of mass density p used in the Newtonian field 
equation of gravitation written in the form of the Poisson’s equation 

V7 = —4nGp. (1.2) 

Note that the energy tensor TJ” replaces mass density p, because of 
mass-energy equivalence, and Einstein tensor E“” replaces V?¢ where 
@ is the gravitational potential. The gravitational constant G = 6.67 x 
10-"! m/s? kg is the common factor in both equations. 

We have discussed tensors and energy tensor in details in this book, but 
stopped short of the grand finale of celebrating Einstein’s equation, because 
we are not yet prepared for that great journey. 

How far are the predictions of Einstein’s theory from those of Newton? 

In the non-relativistic limit, i.e. when the source of the gravitational 
field is pure stationary matter, Einstein equation (1.1) converges to the 
Newtonian equation (1.2). 

In Newtonian theory gravity is an action-at-a-distance force generated 
by massive objects, e.g. the sun. The path of a test particle is determined 
by solving Newton’s second law of motion, which is a second-order dif- 
ferential equation. The second integral of this Equation of Motion (EoM) 
predicts an elliptic path that a planet must follow under the gravitational 
force of the sun. No such force acts on a massless photon, which must follow 
a straight path. 

In Einstein’s theory, all freely falling particles (i.e. particles falling 
under gravity) including a photon, follow straightest lines, or geodesics 
in the four-dimensional space-time. The path of a planet in the gravita- 
tional field of a star, e.g. the sun, when projected on the three-dimensional 
physical space, shows up as an ellipse, as in the Newtonian case. However, 
this ellipse has a slow precession rate, i.e. its major axis turns slowly in its 
plane about the sun. 


See [5, pp. 17-18 in Chapter 1, pp. 218-219 in Chapter 8, pp. 265-271 in Chapter 11]. 
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Fig. 1.4. The world line of a planet in the gravitational field of the sun (a) Kepler’s 
orbit, (b) Schwarzschild orbit. 


We have shown the precession of a hypothetical planet and its world 
line in Fig. 1.4. The effects shown in the diagrams are highly exaggerated 
to make an impression on the reader. The elliptic orbit shown in the dia- 
gram has an eccentricity of 0.7, whereas the maximum eccentricity of orbit 
pertains to Mars, having an eccentricity of 0.093 (so that all planets move 
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in what appear to be circular orbits). The major axis of the orbit in the 
diagram shows an angular displacement of almost 45° in one revolution 
(i.e. in one planet year), compared to about 43” per century in the case of 
mercury (which has the maximum precession rate). 

Keeping these exaggerations in mind let us look at Fig. 1.4. In part 
(a), we have shown the Newtonian orbit, labelled as Kepler orbit, because 
Johannes Kepler had discovered the elliptic orbits of planets through 
detailed observations of their positions in the sky long before Newton. 
In part (b), we have labelled the precessing orbit as Schwarzschild orbit, 
because Schwarzschild solved Einstein’s equation for a spherically symmet- 
ric source, and obtained the geodesic of a planet under the new theory 
of gravitation. In parts (c) and (d), we have drawn the world lines of the 
planet moving along the said orbits. 

Without much ado let us now summarize in the following words. The 
nature of the curvature of the four-dimensional space-time is governed by 
the distribution of energy and momentum (in the case of the earth and the 
stars, the matter itself acting as energy). Einstein’s field equation, which 
establishes the relation between the distribution of energy-momentum and 
curvature, is actually equivalent to Newton’s formula of universal gravita- 
tion when the bodies producing curvature of space-time (like the earth, 
the sun) are stationary. Therefore, Einstein’s General Relativity theory 
does not invalidate Newton’s theory. It provides an alternative approach 
to gravitation through the path of geometrodynamics, an approach in which 
gravity is just a manifestation of the intrinsic geometry of space-time. 
Even though this approach has nothing in common with Newton’s uni- 
versal law of gravitation, its ultimate predictions match those of Newton in 
a non-relativistic situation. However, Einstein’s theory leads to bending of 
light, black holes and many other phenomena that Newton’s theory cannot 
foresee. 


1.8. Conclusion 


Even though the last few sections digressed into a domain that has no direct 
application in this book, they may have placed the reader in a better per- 
spective. Special Relativity hinges on the concept of inertial frames. The 
reader may have appreciated that this inertial frame is just an idealization, 
which does not exist globally. If anything, that was the one of the lessons 
conveyed by the last section. 


What Is Relativity? 19 


There is another justification. The fundamental creed of relativity is 
equivalence of frames of reference. Such equivalence beckons like a mirage 
as long as one is confined within the bounds of special relativity. To 
explain what equivalence truly means, one has to consider equivalence in 
its entirety, i.e. one has also to consider accelerating frames. However, the 
moment one considers accelerating frames, gravity enters by the back door. 
Thus, gravity and equivalence of frames are inseparable. 

The mention of relativity conjures up vision of a paradoxical world where 
space odyssey rejuvenates the youth, speeding clocks tick slowly and matter 
is transmuted into energy. How much of such mind-boggling stories truth 
and how many fiction? 

In the following chapters, we shall follow the relativity postulate to a 
logical end to seek answers to some of these questions. We shall discover 
that this innocent looking postulate (equivalence of frames of reference) 
contains seeds of epoch making consequences. One may not miss its philo- 
sophical message — that an abiding faith in certain values, when carried 
through trials and tribulations to its logical end, can lead to a world of 
miraculous discoveries. 

We quote the following lines from Professor S. Chandrasekhar [10] “It 
is an incredible fact that what the human mind, at its deepest and most 
profound, perceives as beautiful, finds its realization in external nature.” 
To which we add: What the human mind perceives as just and equitable 
finds its realization in the man-made structure of the physical laws. 
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Chapter 2 


Einstein’s Postulates, Their 
Paradoxes, and How to 
Resolve Them 


2.1. Event Point in Space-Time 


Let us start with the following example. A rocket which was fired from the 
ground exploded in the atmosphere. This explosion is an event. Events 
such as this, and more varied than this, play a central role in the concept 
structure of relativity. We shall use double quote --- double unquote to 
indicate an event. For example, we shall say that “the rocket exploded in 
the atmosphere” is an event, to be denoted by a symbol, say, “O”. 

An observer S in Delhi can pin-point the location and timing of the event 
“©” by stating that it occurred 200km west, 250km north, at a height of 
60 km and at exactly 20 hours IST. To convey this information compactly, 
we could imagine a set of X-, Y-, Z-axes whose origin is in Delhi, such 
that the X-axis is directed eastward, the Y-axis northward and the Z-axis 
vertically upward. We shall designate this set of axes, or, the reference 
frame defined by this set of axes, by the symbol S. We shall often use 
the same symbol to mean either the observer or his frame of reference. 
With respect to S, the event “O” is completely identified by specifying 
the four numbers, namely, —200, 250, 60, 20 in this particular order. (The 
first number —200 means that the x coordinate of the event is 200km 
in the negative X-direction.) These four members, when arranged in this 
particular order, are called the coordinates of “O” with respect to S. We 
write “O” = (—200, 250, 60, 20) with respect to S. 
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P y’ 


Fig. 2.1. Coordinates of a particle with respect to frames S and S’. 


Conversely, every set of four numbers, arranged in the order (x,y, z,t), 
will be considered to constitute an “event”. (There is one anomaly in the 
above coordinates, namely x,y,z have the dimension of length, whereas t 
has the dimension of time. This anomaly will be removed in Sec. 3.1 by 
multiplying t with the velocity of light, so that all the four coordinates of 
an event will have the dimension of length.) 

Now consider two frames of reference S and S$’, of which S is the Abso- 
lute Frame of reference conceived by Newton in which his laws of motion 
were assumed to be valid, and S’ is another frame of reference (the earth 
for example) which is moving in the X-direction with velocity u, as shown 
in Fig. 2.1. 

Note that we have tagged the frames by means of flags, a convention 
we shall follow in the rest of this book. We imagine a set of three mutually 
perpendicular rigid bars, serving as the frames of reference — XYZ for 
Sand X’Y'Z’ for S’. We have taken the X-axis of S, and the X’-axis of 
S’ to be parallel and directed along the velocity u of the earth. Written 
component wise, u = (u,0,0) with respect to S. 

Now imagine a particle P of mass m moving in space. Let A be a point 
on the trajectory of the particle. “The particle arrives at A” is an event. 
We call it the event “O,4”. The coordinates of this event are as follows: 


(x, Y,4, t) in S, 
“O04” = 
(2’,y’,2’,t’) in S’. 
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For the configuration shown in Fig. 2.1, the coordinates of the origin O’ 
of $’, with respect to S, are Ro = (Xo, Yo, Zo) at t = 0, and R = (a, + 
ut, Yo, Zo) = Ro + ut at time t. It is obvious that 


vg =L- 2 — ut, 


—_ i 
¥y —_ y Yo; (2.1) 
zZ)=Z-Zo, 


aa i 
Written compactly, 


r =r—R=r—Ro-wW, 
(2.2) 
Y= 4. 
Equation (2.1) and its equivalent form (2.2) constitute the familiar 
Galilean Transformation (GT) with the inclusion of the time coordinate. 


2.2. Inertial Frames 
2.2.1. Newton’s equation of motion 


Much of physics is based on mechanics and it deals with motion of objects. 
The physical quantities associated with motion are displacement, velocity, 
acceleration, etc. These quantities must be specified with respect to some 
reference frame. When we say that the velocity of a steamer is 25 km/hr, 
this implies that the steamer recedes from a point fixed on the bank of 
the river at the rate of 25km/hr. Here, the river bank constitutes a ref- 
erence frame. We can imagine a boat on the river sailing in the direc- 
tion of the steamer at a speed of 10km/hr. This boat constitutes another 
frame. The velocity of the steamer with respect to this second frame will 
be 15km/hr. Therefore, it is meaningless to talk about the laws of motion 
without having fixed before our mind a particular reference frame. What 
reference frame, then, did Newton fix before his mind when he enunciated 
the laws of motion? 

Newton presumed the existence of some Absolute Frame, as mentioned 
in the Sec. 1.2. He identified it with the frame of the ‘fixed stars’. In our 
discussion, we shall tentatively identify this so-called Absolute Frame (AF) 
as one fixed with respect to the sun. Now let us state the first two laws 
of motion, which, by assumption, are valid in this AF. The first law, also 
called the law of inertia, states that the acceleration a of a particle is zero 
in the absence of an external force. 
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The second law states that a is not zero if the particle is acted on by 
an external force F, in which case F equals mass times the acceleration of 
the particle: 


F = ma. (2.3) 


Another Newtonian assumption is that the mass m of the particle is abso- 
lute. It does not change with the velocity of the particle and it is the same 
to all observers. 

Should the above two equations be valid on the earth which is moving 
with respect to the Sun? 

Strictly speaking, the answer is ‘no’. Not because the earth is just mov- 
ing, but because the earth is rotating about its own axis and is also going 
around the sun in a circular motion. However, for the time being let us 
ignore the spinning motion and the orbital motion around the sun, the 
effects of which are relatively small. Let us assume, for simplicity, that the 
earth is moving, without spinning, along a straight line with a uniform 
speed of 30km/s with respect to the sun. 

Let the velocity of the particle P be v with respect to S, and v’ with 
respect to E. Component wise, 


v= dr = (F dy =) with respect to S, 


dt ‘dt’ dt’ dt 
de! (ae? dh ae oa 
whe a (= ae =) with respect to S’. 


Note from Eq. (2.1) that dt’ = dt, dx’ = dx — udt, dy’ = dy, dz’ = dz. 
Therefore, 


Px dz —udt dy dz 
7 dt 7 dt’ dt)’ 


(2.5) 


which is the transformation equation for velocity. 
Let a and a’ denote the acceleration of P with respect to S and &, 
respectively. Then 


ss dv 
~ dt’ 

,_ dv’ _d(v—u)_ dv 
~ dt dt dt 


(2.6) 
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since u is a constant vector. We notice that even though velocity transforms 
under the GT, acceleration remains invariant. 

By our assumption, Newton’s second law of motion, as given by Eq. (2.3) 
is exactly valid in the Absolute Frame S. Also, the measure of the external 
force, as for example determined by the reading on a spring balance, should 
be the same in E as in S. Hence, from Eq. (2.6) 


F = ma’. (a5) 


Thus, Newton’s second law of motion is valid in F, if it is valid in S. 

As a special case let F = 0. This would imply a = 0, as well as a’ = 0. 
This is Newton’s first law of motion, which is therefore valid in E, if it is 
valid in S. 

In summary, if the £ frame is imagined to be a non-rotating frame mov- 
ing with uniform velocity with respect to the S frame, then Newton’s first 
and second laws of motion — which are postulated to hold in S — also 
hold in E. These laws are valid in any frame S$” with similar properties (i.e. 
non-rotating and moving with uniform velocity with respect to the AF). 
By induction, since these laws are valid in S’, they are valid in any other 
non-rotating frame S$” moving with uniform velocity with respect to S’. 
By this process, we obtain an infinity of frames of reference which are 
non-rotating and moving with uniform velocities with respect to one 
another, and with respect to the AF — and Newton’s laws of motion are 
valid in them all. 

A frame of reference in which the law of inertia (i.e. Newton’s first law 
of motion) holds is called an inertial frame (IF) — as we have discussed at 
some length in the previous chapter. We recognize that there is an infinite 
number of IFs. Let S' be any one IF (not necessarily the AF) and let S’ be 
another. If, at t = 0, the origin of S’ is at the coordinates rp = (Zo, Yo, Zo) 
and moving with velocity u with respect to S, then the Galilean transfor- 
mation from S$ to S’ gives us the following relations: 


r=r-—ro-—ut, 


=, 

; (2.8) 
v=v-u, 
a’ =a. 


All IF's will measure the same acceleration of a moving object. 
In the above we have considered only Newton’s first and second law of 
motion. What about the third law? The third law of motion is a corollary of 
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the law of conservation of momentum. We shall now discuss the invariance 
of the laws of conservation of momentum and energy under GT. 


2.2.2. Conservation of energy and momentum 


Conservation of energy and momentum are among the foundational princi- 
ples of physics. We shall show that these laws are valid in all inertial frames 
by examining a two-body collision. 

Figure 2.2 shows a view of two particles A and B engaging in a collision, 
as viewed from some inertial frame S. The collision — a term which is often 
used to mean a passing interaction between two particles — may result in 
the creation of new particles after the original ones have encountered each 
other. Therefore, for the sake of generality, we are considering two different 
particles C and D emerging from the scene of collision. 

In Newtonian physics mass is conserved, so that 


matmp=mo+mp. (2.9) 


Let us postulate that the total momentum and the total kinetic energy of 
an isolated system (i.e. a system which is not influenced by anything from 
outside) be each conserved in an elastic collision, when viewed from an 
inertial frame S. This gives the following two relations: 


(i) Conservation of Momentum: 
MAVA +MBVB=™MoVC +MpvVp. (2.10) 


(ii) Conservation of kinetic energy: 


1 1 1 
Zara + ZMBUB — xmove + s™mpup: (2.11) 
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Fig. 2.2. Collision of two particles. 
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We shall now use Eq. (2.8) to rewrite the above equations in terms 
of velocities, as measured in the frame S’. Letting v4, v‘g, VG, Vp be the 
velocities of the respective particles in the frame 5”, we have 


ma(v', +u) + ma(vp tu) = mo(vo + u) + mp(vp + u). 
Using Eq. (2.9), the above equation reduces to the following form: 
mavis +MBv_, =MoVo+MpVp- (2.12) 


This shows that the total momentum, as measured in the frame S’, is the 
same after a collision, as it is before the collision. We now convert Eq. (2.11) 
along parallel lines: 


1 1 1 1 
gma(via tu)” + 5ma(vp tu)’ = 5mo(ve + uy’ + 5mp(v +4)”, 


1.e. 
1 


1 1 
5mav'a + 5B p +u.(mav', +mpv'p) + zu (ma + mp) 


1 1 1 
= 5mev'c + smu p +u.(mov'c +mpv'p)+ zu (me +mp). 


Using (2.9) and (2.12) the above equation reduces to the desired form: 


rae + Sine, = Sie + apts (2.13) 
2 2 2 2 

Equation (2.13) now validates conservation of the kinetic energy in the 

frame 8S’. 

In summary, if the laws of conservation of linear momentum and kinetic 
energy are established in one inertial frame, then they are established in 
all inertial frames, as a consequence of the Galilean transformation. We 
can extend the same principle to other physical quantities, like angular 
momentum and total energy (i.e. the sum of the kinetic energy and the 
potential energy) and establish that they are valid in all inertial frames. 


2.2.3. Equivalence of inertial frames 


The conclusions just reached in the previous sections lead to equal status 
of all inertial frames — at least within the limited scope in which we have 
examined them. They clearly tell us that all inertial frames are equivalent 
with respect to the experiments and laws of mechanics. A coin tossed in- 
side a smoothly cruising jet plane will fall exactly in the same way as it will 
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fall on the ground. Considered from a more general perspective, every IF 
has all the properties that characterize the hypothetical AF. 

This is the Newtonian principle of relativity. In essence it says that 
identical mechanical experiments performed in different inertial frames will 
yield identical results. The laws of conservation of mechanical momentum, 
mechanical energy and the laws of motion which are presumed to be valid 
in the AF are seen to be valid in all IFs, as a consequence of the GT. 
Therefore, there is no experiment, at least in the domain of mechanics, by 
which this hypothetical AF, even if it exists, can be identified. 


2.3. Historical Background 
2.3.1. Search for the absolute frame 


If it is impossible to tell, by any means whatsoever, which frame is AF and 
which frame is not, then why should there be any reason for hypothesizing 
an AF at all? 

With the formulation of the laws of electricity and magnetism, how- 
ever, the need for this AF became evident. Clerk Maxwell rationalized the 
phenomena of electricity and magnetism into a set of equations known as 
Maxwell’s equation. Maxwell’s equations lead to wave equations for the 
electric and magnetic fields, showing a characteristic wave speed c which, 
in vacuum, equals 3 x 10° m/s, the same as the speed of light. This means 
that if you change the charge-current configuration somewhere in space, 
the electric and magnetic fields will change everywhere in space, but the 
field will change earlier in the near region and later in the far region. The 
messenger that carries the command for change from a near region to a 
far region is the electromagnetic wave and it propagates with the speed of 
light, somewhat in the same manner that ripples propagate the informa- 
tion of a disturbance on the surface of a pond with a much slower velocity. 
Radio waves, visible light, X-rays are all such electromagnetic waves, lying 
in different band zones of the frequency spectrum. 

Before the formulation of the theory of relativity it was generally 
believed by physicists that electromagnetic waves were similar to mechan- 
ical waves — like sound, seismic waves, ripples on the surface of a pond. 
Each of these examples is associated with a medium that carries the wave. 
A disturbance in the mechanical configuration occurs somewhere in the 
medium, and this information is sent outwards by the elastic properties of 
the medium in the form of a wave. Water surface is the medium for ripples, 
air for sound, earth for seismic waves. 
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It was generally believed that the “mechanical medium” that carries the 
disturbance called electromagnetic wave, or light, is aether. Many physi- 
cists, including Maxwell himself, dabbled with the hypothetical proper- 
ties of aether. Aether was thought to be a fluid that pervaded all space, 
penetrated all materials, had some extraordinary properties, like perfect 
elasticity (so that no energy is extracted out of light when it propagates 
through it) and extremely high modulus of rigidity (so that light waves, 
oscillating at very high frequencies, of the order of 10!° Hz, could propagate 
through it). 

One now gets a clue for identifying the AF. There must be some frame of 
reference, say Sg, in which Maxwell’s equations are valid exactly. However, 
if they are valid exactly in So, then they cannot be exactly valid in some 
other frame S$, because any GT applied to these equations will destroy their 
forms (equations of electrodynamics involve first-order derivatives whereas 
those of Newton involve second-order derivatives like tr), Therefore, peo- 
ple were inclined to believe that all inertial frames were not equal, that 
among all the IFs there did exist one privileged frame, which alone was 
entitled to claim the equations of electrodynamics, and had, therefore, an 
absolute character. That frame of reference must be the long-cherished AF. 

It was therefore speculated that aether was at rest in the AF, and light, 
which propagates in aether, must have an absolute speed c in all directions 
with respect to this aether. 

How to identify this AF? The answer should not be difficult even for a 
lay reader. In any other IF, which is moving, say in the X-direction with 
velocity u with respect to the AF, the speed of light in the +X direction 
will be c— u, and in the —X-direction will be c+ u. The velocity of light 
in different directions should in fact be different in this new IF. Among all 
the IFs there is one, and only one, IF in which the speed of light is same 
in all directions and that frame alone is to be identified as the AF. 

An experiment can be devised to measure the difference in the velocities 
of light in two different directions on the surface of the earth. This will give 
us immediate information about the velocity of the earth relative to the 
AF. A large number of ingenious experiments, of which the Michelson and 
Morley’s experiment is most well known, have been performed to measure 
this difference. Contrary to everybody’s expectations no difference has ever 
been found. All experiments on the velocity of light have unmistakably 
shown that light propagates with the same constant speed c in all directions 
in vacuum, at all times and in all seasons, in the frame of reference of the 
earth. 
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The Michelson—Morley experiment attempts to determine the velocity 
of the earth relative to aether. Alternatively, since aether is assumed to 
be at rest in the Absolute Frame, the outcome of the experiment should 
determine the velocity v of the earth relative to the AF. It was found from 
this experiment that v is zero. 

In summary, it will be sufficient to say that the theory of relativity is 
founded on the premise that there is no aether and no Absolute Frame. 

We shall now give a brief account of the Michelson—Morley experiment. 


2.3.2. Michelson—Morley experiment 


The Michelson—Morley experiment (to be abbreviated as MM experiment 
in the following) utilizes Michelson interferometer. Figure 2.3(a) describes 
the basic set up of the apparatus. Here S is a source of a monochromatic 
light. A collimator takes a parallel beam out of this source. This beam is 
split into two components ¢; and ¢2 by the partially silvered mirror A. The 
component $1, which is transmitted through A towards B, gets reflected 
back at the mirror B, and then comes back to A. The other component ¢2 
is reflected upwards at A, goes to the mirror C, and is reflected back to 
A. These two beams, after return to A, get partially reflected again and 
partially transmitted again at A, so that a fraction of each one of them, say 
half of ¢; and half of 2, will now recombine and proceed along the path 


Mirror aS / pied 


vD 
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Fig. 2.3. Schematic arrangement of Michelson—Morley experiment. 
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AD as a single beam. This recombination of two fractions of what used 
to be a single beam earlier, after they have travelled through two different 
path lengths, causes what is known as “optical interference”. The lens will 
focus these interfering beams onto a screen. 

Suppose the earth is moving, with respect to aether, with velocity v in 
the direction of the line AB, which we take as the X-axis. This means that 
aether is moving in the negative X-direction with speed v with respect to 
the earth. Since light travels in aether with speed c, the speed of light in 
the Lab frame as it travels along the +X-axis will be c — v, and along the 
—X-axis will be c+v. If T, is the time required for the beam ¢; to go from 
A to B and then travel back from B to A, then 


i i OL v2)! aL v 
—_ + =— sie > Bit, 
c—Uv c+uvu c Cc C Cc 


Now consider the second beam ¢2 following the path ACA, which it covers 
in time 75. To an observer at rest in aether, this beam must follow the 
slanted path AC’A”, shown in Fig. 2.3(b), in order for it to reach the 
mirror A which has moved to the point A” during the same time 7). If v 
is the velocity of aether with reference to the lab and c is the velocity of 
light with reference to aether along AC’, then the velocity of the beam ¢2 
in the Lab frame along the upward path AC is the vector sum c+ v. The 
magnitude of ¢ + v is 


In the same way, one computes the speed of the beam in the Lab frame 
down the path CA to be 


The time taken by the beam ¢2 to cover the path ACA is therefore 


bd OE Be oF, Lv’? 
moby bah 2PM WE Ay 
Uy U2 Cc Cc Cc 2c 


Therefore, there is a time difference 


Lv? 
of =T, -Ih=— 
c 
between the times of arrival of the two beams, when they recombine at A to 
cause an interference fringe pattern. The order of the fringe at the centre 
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of the screen is 


coT Lv? 

dh Ae?’ 

where A is the wavelength of the light beam used. As the earth goes round 
the sun in a circular orbit, its velocity relative to the AF keeps changing 
direction. Six months later, the direction of the velocity of the earth, which 
was in the +X-direction at the start of the experiment, will now change 
into the +Y-direction, so that the aether wind will now blow in the —Y- 
direction in the reference frame of the earth. The roles of the paths ABA 
and ACA in the above experiment will now get interchanged, so that now 


wae 2L 2 
T, = time taken to cover ABA = = 1+->]|, 


20 7 
T; = time taken to cover ACA = — E + | 
C c 


Hence, the new fringe order at the centre of the screen will be 
n' =—n. 
Therefore, in six months the interference pattern will shift through 


2QLv? 
Ac? 


N=n-n =2n= fringes. 


In the actual experiment no such fringe shift was observed. 


2.4. Postulates of Special Relativity 


The null result of the MM experiment demolishes the notion of Absolute 
Frame. Special Theory of Relativity therefore starts with the premise that 
all IFs are equal — not merely with respect to the laws of mechanics, 
but for the whole of physics. We have, however, seen that the laws of 
electrodynamics do not appear to be the same in all IF's. The answer to 
this paradox lies in recognizing the fallacy of the Galilean Transformation. 
Before showing how the paradox is resolved satisfactorily, it will be desirable 
to enunciate the two fundamental postulates of Special Relativity on which 
its entire concept structure rests. 


Postulate 1 (Relativity Postulate). All inertial frames are equivalent 
in the sense that identical experiments performed in different inertial frames 
will yield identical results. 
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We mentioned earlier Maxwell’s equations lead to a characteristic speed c of 
light (by light we shall mean any electromagnetic wave). These equations 
also show that light emitted by a moving source (for example, every charged 
particle under acceleration radiates light) propagates in all directions with 
equal speed c, and c is independent of the velocity v of the source. This is 
a key notion on which further progress of our theory depends. Hence the 
following postulate. 


Postulate 2 (Source-independence of the speed of light). Light 
propagates without any medium with a speed c whose value is same in all 
directions, and is independent of the velocity of the light-emitting source. 


By combining the above two postulates we get a very important corollary. 
Consider a light emitting source LZ which is moving with respect to a frame 
S. There is some comoving frame So in which L is at rest, and light emitted 
by it propagates at speed c. Since propagation speed does not depend on 
the velocity of L (postulate 2), and since this characteristic speed should be 
same in all frames of reference (postulate 1), propagation speed c in So (in 
which L is at rest) should be same as in S' (in which L is moving). Hence 
the following corollary. 


Corollary 2A. The speed of propagation of light is given by c = 3x 10° m/s 
in all inertial frames, independent of the motion of the light emitting source. 


2.5. Relativity of Simultaneity 


Let us imagine two inertial frames S and S’. S” is moving relative to S 
with velocity u in the direction of the X-axis, which is taken parallel to the 
X'-axis of S’ (Fig. 2.4(a)). Times ¢ in S and t’ in S’ are measured from the 
instant when the origins O and O’ of the two frames just pass each other. 
At that very moment a sharp flash of light is emitted from a source L which 
is fixed to the origin of S’, so that L is stationary in S’, but moving with 
speed u in S. According to Corollary 2A, light will be propagating radially 
with speed c with respect to S, as well as S’. This means that a spherical 
wavefront , diverging from the origin O of S with speed c and having 
radius ct, will contain this light flash at the instant t. Similarly, another 
spherical wavefront X’, diverging from the origin O’ of S’ with speed c and 
having radius ct’ at the instant ¢’, will contain this same light flash. If the 
clocks of the observers S and S’ are assumed to tick at the same rate, then 
at the instant ¢ = t’, the same flash of light is simultaneously contained 
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Fig. 2.4. Simultaneity paradox. 


in two different wavefronts © and =’ which have the same radius ct = ct’. 
This is absurd. What has gone wrong? 

We went wrong by believing in one universal time which, as we shall 
find, does not fit with the postulates of relativity. Consider the same light 
flash as discussed in the previous paragraph. Imagine two points P and Q 
on a sphere of radius ct in frame S (Fig. 2.4(b)). Two events, e.g. “light 
reaches P” and “light reaches Q” which we designate as “Op” and “Og” 
respectively, both occur at the same time t and are, therefore, simultaneous 
in S. Their coordinates are 


“Op” = (21,41, 21,1) : 
: in S. 
“Og = (x2, Y2, 22,t) 


However, in this time t, O’ has been displaced by the distance ut to the 
right of O. Consequently, P and Q no longer lie on the same sphere with 
centre at O! (Fig. 2.4(c)). P will be nearer to O’ than Q. 

Therefore, the event “Op” occurs earlier than the event “Og”, according 
to S’. The coordinates of the same events with reference to 5S” will be 


“Op” = (71,41, 21,4) : , 
“ ” Pogd of gt ns : 

OQ” = (&2, 92, 22, ¢2) 
and t, < th. We therefore see that the relativity principles are in contradic- 
tion with the Newtonian concept of universal time. Time assumes a relative 


character in relativity. In particular, we have just discovered the following 
important rule. 


Rule 1. Two events which are simultaneous with respect to an observer S 
cannot be simultaneous with respect to another observer S' who is moving 
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relative to S (unless the direction of motion happens to be perpendicular to 
the straight line joining the spatial locations of the events). 


One can advance arguments to show similar discrepancy with respect to 
distance measurement also. Time and distance are both divested of abso- 
luteness in relativity. They give different measures to different observers 
who are moving with respect to each another. 

Relativity rejects many of the intuitive notions of Newtonian physics. 
One of the first casualties is Galilean transformation, as the following exer- 
cise will illustrate. 

Consider an event “®” namely “reception of the light flash at some 
point P”. Let the coordinates of this event be (x,y, z,t) in the frame S. 
This means that the light ray has covered a distance \/a? + y? + 2? in 
time t, as measured in S. Therefore, 


zg? +y? +27 —c7? =0. (2.14) 


This is an equation of a family of concentric spheres, representing wave- 
fronts W,, W2, W3,..., corresponding to different times t,, to, t3,..., as seen 
from S, and as shown in Fig. 2.5(a). In order to obtain the equations of 
these wavefronts in the frame S’, we shall have to transform the above 
equation with the help of Eq. (2.1). This gives the desired equation: 


(a! + ut)? + y!? + 2 = ct”. (2.15) 


(a) (b) 


Fig. 2.5. Wavefront series seen from S and 9’. 
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Equation (2.15) describes a series of spheres Aj, Az, A3,... whose cen- 
tres C1,C2,C3,... are located along the negative X-axis, as shown in 
Fig. 2.5(b). However, Corollary 2A requires that both S and S$’ should see 
concentric spherical wavefronts with centres fixed at their respective ori- 
gins. Therefore, a new set of transformation equations between (x, y, 2, t) 
and (2’,y’,2’,t’) is required to replace the GT. This new transformation 
should satisfy the requirement that if the coordinates of the event “6” with 
respect to S satisfy (2.14), then the coordinates of the same event “®” with 
respect to 5” must satisfy a similar equation, namely 


al? + y/? + 2! — Pe? = 0, (2.16) 


As a prelude to the new transformation rule (to be called Lorentz trans- 
formation), we shall consider two paradoxical and important consequences 
of our postulates, namely time dilation and length contraction, in the fol- 
lowing sections. 


2.6. Time Dilation 


Imagine the Michelson interference experiment being performed on a train 
which is moving with velocity v with respect to the platform as shown in 
Fig. 2.6(a). Let “a” represent the event that the ray ¢2 — after coming 
from the source S — is “reflected upwards at the mirror A”. Let “8” repre- 
sent the event that this ray — after bouncing downward from the overhead 
mirror — is “received back at the mirror A”. 


“vr/2 | vT/2_ 
Light ray 6, seen from frame $ 


(b) 


Fig. 2.6. Coordinates of a particle with respect to frames S and E. 
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An observer sitting in the train — call him Mr P (P for passenger) — 
marks the path followed by this ray. We shall denote his frame of refer- 
ence — i.e. the frame fixed on the floor of the train — by the same letter 
P. Let the length of the arm AC, as measured in the frame P, be Ly. Since 
light propagates with speed c in P, the time of flight Jo of the ray @2 in 
covering the round trip ACA, and as measured in the frame P, is 


ery 
=—*. 


7 ay) 


Another observer Mr S$ (S for Station Master), standing on the railway 
station, watches the same pair of events “a” and “8”. He finds them taking 
place at two different points A and A” on the track (Fig. 2.6(b)). Light 
takes a longer route AC’A” in the frame S, and therefore, travelling with 
the same speed c (according to Corollary 2A), must take a longer time T 
(as measured in S$) in covering this route. Assuming that the length of the 
vertical arm of the MM apparatus (i.e. the perpendicular distance between 
C’ and the line AA”) measures out to be the same length Lo in S as in P 
(we shall justify this statement later), the length of the path traversed by 


light, as seen from the frame S, is 2,/L2 + {2 y, which should now equal 


cl’. Therefore, 
2 vT )? 
Tse 4 > . a4 
(H+ {>} (2.18) 


Using Eqs. (2.17) and (2.18), one obtains a relation between T and To: 


so that 


y2 
Ty = 4/1 - —T. (2.19) 
C 


Note that To and T are the time intervals, as measured in P and S respec- 
tively, between the same pair of events “a” and “8”. 

The reader may conclude from the above discussions that “moving 
clocks go slow”. Mr P is moving and Mr S is stationary! Therefore, Mr P’s 
clock registers smaller time than Mr S’s clock. However, such arguments 
are wrong and contradict the very spirit of the relativity principle. 

The fallacy in the above statement lies in that if S can claim that P’s 
clock is going slow because P is moving (relative to S), then P can also claim 
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that S’s clock is going slow because S is moving (relative to P). As we have 
stressed in Chapter 1, motion is relative. And though it may sound strange, 
relativity postulates find both P and §S to be correct. Each one’s clock is 
going slow with respect to the other. 

Let us resolve the paradox implied in the last sentence. We define proper 
time between any two events to be the time interval between these events, 
as measured in one particular frame of reference So in which both the events 
happen to take place at the same spatial location. What we mean by this 
is that if two events have coordinates (11, y1,21,t1) and (2, ya, 22, t2) in 
So, such that 7] = 22,y1 = yo,21 = 22, then we call Tp = tz — t; the 
proper time between the events. It should be remembered, however, that 
for a particular pair of events there may or may not exist a frame in which 
both the events will occur at the same location, and hence, there may or 
may not exist a “proper time” between these events. 

In the above example, the events “a” and “8” occur at the same floor 
location A in P’s frame of reference, whereas they take place at two different 
locations, namely A and A”, along the railway track, as seen from S’s 
frame of reference. Therefore, P measures proper time between “a” and 
“6B” whereas S measures “improper” time. Please note that we are using 
the terms “proper” and “improper” not to mean right and wrong. Proper 
time is just a nomenclature, a definition. “Improper time” is any time 
interval that does not satisfy that definition. It has been assumed that both 
observers P and S are using standard clocks for measuring time intervals, 
and, therefore, both measurements, i.e. “proper time” and “improper time”, 
are correct measurements, but with respect to different observers. It should 
be understood that the term “proper time” has no meaning except with 
reference to a particular pair of events. 

The correct conclusion from the result of the above calculation is that 
the proper time interval Tj between a given pair of events “a” and “S” 
is always less than the corresponding “improper” time interval T between 
them. 

It will be convenient to introduce at this stage the Lorentz factor, 7 
which we define as follows: 


1 
————,_ where B= . (2.20) 


VF 


Since the Lorentz factor y defined above is associated with the operation 
boost (see the meaning in Sec. 3.1), we shall in future call it boost Lorentz 
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factor, in order to distinguish it from the dynamic Lorentz factor T to be 
introduced later through Eq. (4.15), in Chapter 4 
Note that 


Ae A (2.21) 


and the following identities which can be proved easily: 


eo 2 Q2 1 2 i 2 2 2 Q2 
=70", 1-z=86, poe) ales p. 


1- 6p? ¥ 
(2.22) 


Using the Lorentz factor, we shall summarize the time dilation formula 
(2.19) in the form of the following very important rule: 


Rule 2. Let there be a frame of reference So with respect to which two events 
“a” and “8B” occur at the same spatial location. Let S be another frame of 
reference which is moving with uniform velocity v with respect to So. If To 
and T be the time intervals, as measured in So and S,, respectively, between 


“a” and “8” (so that To is the proper time between the events), then 


1 
To = V1—(?T = -T. (2.23) 
oy 


2.7. Length Contraction 


In a similar vein, we can define proper length to be the length of an object 
in its rest frame, i.e. the length measured in that particular frame in which 
it is at rest. 

Let us once again examine the MM experiment being conducted on the 
train as was shown in Fig. 2.6. This apparatus is stationary in the frame of 
reference of the train. Therefore, the length Lo of the arms AB and AC, 
when measured with meter sticks laid on the floor of the train (or held 
stationary on board the train), are the proper lengths of these arms. The 
same lengths measured by meter sticks laid on the ground may be called 
“improper” lengths of these segments. Instead of using meter sticks, an 
alternative and better means of the length measurement will be with the 
help of light beams, which we shall now employ for conceptualizing the 
length paradox. 

Let us, therefore, assume that both P and S measure the length of 
the arm AB using the time of flight of a light ray from A to B and then 
back from B to A. Let these measurements be Lo to Mr P, and L to Mr S. 
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We shall establish a relationship between Lp and L with the help of relation- 
ship (2.19) between proper time and “improper” time. For this purpose, 
we shall identify three significant events along the journey route ABA of 
the ray of light. These events are: 


e “O41” = “the light ray passes through the half silvered plate A” (on its 
way towards the mirror B). 

e “Op” = “the ray is reflected back at the mirror B”. 

e “Oy.” = “the ray returns to the half silvered plate A” (after being 
reflected at the mirror B). 


As light travels with velocity c in the reference frame of the train, Mr 
P measures the same propagation time Tj for the light ray to go from A to 
B, i.e. between the events “O4;” and “Og”, and also to return from B to 
A, i.e. between the events “Op” and “O42”. The total time of the round 
trip flight is then To = 27]. The total length travelled during this time is 
2Lo. Therefore, 2L9 = 2cT{ = cTp. Hence, 
clo 
aa 

Mr S observes the same phenomena from his own reference frame, the 
platform. According to his watch, the light ray takes time 7) for its forward 
* and “Op,” during which time the mir- 
rors move from the locations A and B to A’ and B’, and a different time T> 
for the return trip (i.e. between the events “Og” and “O49” during which 
time the mirrors move from A’ and B’ to A” and B”), as suggested in 
Fig. 2.7. 

It should be clear from these diagrams that cT, = [+ vT,, and cT) = 
L — vT»2, so that T; = 4; Th = 


c—U ctu" 


Lo = (2.24) 


trip, i.e. between the events “O 4)’ 


Fig. 2.7. The ray ¢1 traced by Mr S. 
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The total time for the round trip according to Mr P is then T = 7,+7) = 


2Le 
c — Vv? cT v2 
Lh= T=—|1-—). 2.25 


a>. Therefore, 

Note that To, appearing in Eq. (2.24), is the “proper time”, and T 
appearing in Eq. (2.25) is an “improper time”, between the events “O41” 
and “Q 42”. Connecting Eqs. (2.25) and (2.24) with the help of Eq. (2.23) 
one now obtains in a straightforward way the required relationship between 
L and Lo, which we have written below as Eq. (2.26) at the conclusion of 
the following important rule. 


Rule 3. Let Lo be the proper length of a rod (as measured in its rest frame 
So). Let the rod be moving longitudinally with velocity v with respect to 
another frame S (i.e. v is parallel to the axis of the rod), and let the 
(improper) length L of the rod, as measured in the frame S, be L. The 
relation between the two is given by the formula: 


Io = ————=L = 4. (2.26) 


Compare this with the relation between the proper time and improper 
time as given in Eq. (2.23) 

Since y > 1, it is seen from Eq. (2.26) that Mr S measures a smaller 
value for the longitudinal dimension (i.e. the dimension which is parallel 
to its direction of motion) than Mr P. He therefore concludes that when an 
object moves, its longitudinal dimension contracts. 

It will be in order to suggest an operational model of length measure- 
ment both for subsequent reference as well as for elucidation of the meaning 
of Eq. (2.26). The length of a moving stick can be measured by taking its 
shadow-graph on a photographic plate laid on the floor (or on a table) 
of the laboratory as the stick shoots past it. We have illustrated this in 
Fig. 2.8(a). An overhead array of flashguns has been provided for casting 
shadow on the plate. The end guns G; and G2 are triggered simultane- 
ously, so that the shadows P, and P2 of the endpoints A; and Ag of the 
stick are also etched simultaneously on the plate. (Note that these etching 
events are simultaneous in the Lab frame only.) The distance L between 
the marks P; and P2 (which are permanent marks on the table) measured 
subsequently using any standard meter stick will give the laboratory mea- 
sure of the length of the moving stick. It is this LZ which is related to the 
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Fig. 2.8. Measurement of longitudinal length. 


“proper length” Lo of the stick through Eq. (2.26). (We have obtained this 
relationship using Lorentz transformation in Sec. 3.3). 

We shall adopt the following mathematical description for the outcome 
of the above length measurement experiment. 


Theorem 2.1. Let R and Ro be two straight rods sliding past each other 
longitudinally with a relative speed cB. If the segment P,P2 of R coincides 
with the segment Ay A> of Ro, when viewed from R (so that R is stationary 
and Ro is seen to be moving), and if the proper lengths of PyP2 and A, Az 
are L and Lo, respectively, then 


1 
L=-—Lo. (2.27) 
" 


A popular way of saying the same thing is: a rod of proper length Lo 
shrinks to a smaller length L = =Lo, when moving with velocity Bc. We 
have portrayed this concept graphically in Fig. 2.8(b). 

Theorem 2.1 will play a very important role as the mathematical model 
of length measurement in the subsequent development of the formalism. 

In Sec. 3.6, in Chapter 3 we have reinforced the concepts of simultane- 
ity, time dilation, length contraction through Theorems 3.1-3.3. We have 
shown through actual calculations: 


e the exact time difference between two events in a second frame, when the 
same events are simultaneous in the first one; 
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e the relation between the two different lengths ¢’ and ¢” measured in a 
given frame of reference, when they match the same length € measured 
in another frame of reference. 

e the relation between time intervals T/ and T} in a given frame of reference 
when the corresponding time intervals are the same in another one. 


The reader should understand the logic presented there in order to get 
a better grip of these exotic concepts. 

We shall now take up transverse length and suggest a reason why the 
transverse dimension of an object (i.e. the dimension measured perpendic- 
ular to the direction of its motion) should not change due to the motion. 

Imagine two identical and parallel rods L and R, each equipped with 
markers at their ends (Fig. 2.9). Let A and B be the markers of L, and C 
and D the markers of R. When at rest, the marker A coincides with C, and 
the marker B with D, thereby confirming that the proper lengths of the 
rods are equal. Now let the rods move transversely towards each other. As 
they zip past each other, the markers mark the opposite rods. If the length 
of the rod L, as seen from the rest frame of R, remains unchanged, then 
the marks made by A and B will fall on C and D, respectively. Otherwise, 
only one set of markers, say, A and B of the rod L, will be able to make 
marks on the body of the other rod R, implying thereby that the rod L 
has become shorter due to its motion relative to R. An observer in the rest 
frame of R would then conclude that the length of the rod L has shrunk 
due to its transverse motion. 

An observer in the rest frame of L can also examine the marks. There 
cannot be disagreement between the two observers about the fact that these 


GC 
A Cc A 
sn 
R IU 
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Fig. 2.9. Measurement of transverse length. 
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marks lie within the body of R. Therefore, this observer would conclude that 
the length of the rod R (which is in motion relative to L) has expanded due 
to its transverse motion. 

Thus two different observers observe two different effects of speed on 
the transverse dimension of a rod (i.e. expansion in one case and contrac- 
tion in the other). This contradicts postulate 1. Hence, the conclusion that 
transverse dimensions cannot change due to motion. 

We shall therefore record the above finding in the following rule. 


Rule 4. When a rod moves transversely, as seen from a frame S, its length 
L as measured in S' equals its proper length Lo. 


We shall supplement rules 3 and 4 with two important corollaries. We 
illustrate them with the help of Fig. 2.10. It shows an object A of arbitrary 
shape. So is the rest frame of the object (we assume that every part of A 
is at rest in So) 


Corollary 4A. Let the rest frame So be moving with velocity v along the 
X-axis with respect to another frame S. Let the dimensions of the object 
along the X-, Y- and Z-directions be 6x0, d6Y0,620 in So and bx, dy, dz in 
S. Then 


1 
du =—dx9; dy=dyo; bz = 620. (2.28) 
v 
A y 

bx 

a S ox 

Vv 
oe dy 
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Z Zz 


Fig. 2.10. Volume transformation. 
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Corollary 4B. If )Vo is the proper volume of the object (i.e. the volume 
measured in So), then its volume, as measured in S is 


i= 2 aV. (2.29) 
7 


With the foresight gained through the discussions presented in this sec- 
tion, we shall now search for the correct transformation equation connecting 
the coordinates of a given event in two different inertial frames of reference. 
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Chapter 3 


Lorentz Transformation 


3.1. Lorentz Transformation I: Special Case 


We shall obtain the transformation equations between the coordinates 
(x,y,2,t) and (a’,y’, 2’,t’) of an event as measured in two frames of ref- 
erence S' and S’ which are moving relative to each other. The underlying 
concepts are best exemplified by considering the simplest example, namely 
boost in the X-direction. 

We shall use the term boost to mean relative motion between two frames 
of reference. Figure 3.1 illustrates a boost c3 of S’ relative to S along the 
X-axis, which we shall write compactly as “boost: S\(c6,0,0)S’”. Note that 


boost:S (cB, 0,0)S’ = boost:$"(—c8, 0, 0)S. (3.1) 
That is, the above arrangement also means a boost —c@ of S relative to S’ 
along the X’-axis. A more general boost will be 
boost:S(cBz, cBy, c8z)S" = boost:S’(—cBz, —cBy, —cBz)S. (3.2) 
For all such boosts, we have the following assumptions: 


#1 The coordinate axes XYZ of S are parallel to their counterparts 
X'Y'Z' of S’. 

#2 The time origins of both frames are chosen to be the instant (i.e. the 
clocks in both frames are set to zero hour at the instant) when the 
origins O and O’ of the space frames cross each other. 

#3 For the special case of boost:S(c@, 0,0)”, we further assume that S’ is 
moving in the direction of the X-axis with velocity v = cf relative to 
S, and that the X’-axis lies along the X-axis. 


We shall call this configuration the standard configuration. 
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Z Z 


Fig. 3.1. Standard configuration of frames S' and S’. 


An event “©” viewed from these two frames of reference is also shown 
in Fig. 3.1. The transformation equation that we shall obtain for this par- 
ticular case is a simplest special case of a general class of relativistic trans- 
formation equations, called Lorentz transformation, which we shall often 
abbreviate as LT. 

An anomaly exists among the coordinates x, y, z,t in the sense that the 
last coordinate has a different dimension than the first three. The usual 
convention in relativity is, therefore, to adopt ct for the time coordinate. 
This sounds reasonable because of the frame independence of the speed 
of light. From now on, the coordinates of every event will be written as 
(ct, x,y,z), with the time coordinate preceding the space ones, and each one 
of the coordinates having the dimension of length. In conformity with this 
practice, we shall measure all time intervals in the unit of ct. We shall, for 
example, say that a certain event has occurred at the instant ct, or that 
the time interval between two events “a” and “G” is cot. 

Let us therefore think of an event “O” whose coordinates are (ct, x, y, Z) 
in S and (ct’, 2’, y’, 2’) in S’. We shall find a relationship between the two 
sets of coordinates. For convenience of picturization imagine, the event 
is a lightning which strikes along the X- and X’-axes. We think of these 
axes as infinitely long straight rods sliding longitudinally along each other 
(Fig. 3.2). The lightning “O” leaves permanent marks Q’ on the X’-rod 
and Q on the X-rod. Since y and z coordinates of this event are identically 
zero in both S$ and $”’, we shall ignore them for the time being and write 
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Fig. 3.2. Lightning © as seen from S and S$’. 


the event “O” as 


“Q” = “Q! passes Q” = aa) (3.3) 
(ct',2') in S’, 


where « = OQ = proper length of the intercept OQ, and 2’ = O’Q’ = 
proper length of the intercept O'Q’. 

Figure 3.2(a) shows a view of the event as seen from the frame S. We 
have represented the viewing frame with continuous lines and the viewed 
frame with broken lines. The origin O’ of S’ is seen to coincide with the 
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mark P on the X-rod at the instant when “O” occurs. By this we mean 
that the event “W” = “O’ passes P” is simultaneous with “©” in the frame 
S, i.e. they have the same time coordinate ct in S. From assumption # 38, 
OP = £ct. According to (3.3), the “moving segment” O’Q’ (moving to 
the right) and the “stationary segment” OQ have proper lengths x’ and 2, 
respectively. The proper length of OP = fct. Therefore, the proper length 
of the segment is given by PQ = OQ — OP = 2 — fect. Also, seen from S, 
the “moving segment” OQ’ coincides with the “stationary segment” PQ. 
Hence, from Eq. (2.27), 


y (3.4) 


Let us now view the same lightning, i.e. the event “O” from the frame 
S’, represented in Fig. 3.2(b). The origin O of S, while moving to the left, is 
seen to coincide with some mark M’ on the negative X/-axis at the instant 
when “O” occurs. By this we mean that the event “®” = “O passes WM’ 
is simultaneous with “©” in the frame 5S’, i.e. they have the same time 
coordinate ct’ in $’. By assumption #3, OM’ = ct’. According to (3.3), 
the “stationary segment” O’Q’ and the “moving segment” OQ (moving to 
the left) have proper lengths x’ and x, respectively. The “moving segment” 
OQ and the “stationary segment” M’Q’ have, therefore, proper lengths x 
and (ct! +2’, respectively, and they coincide when seen from S. Therefore, 
by using Eq. (2.27) again, 

Bet’ +a2/ = 


R218 


(3.5) 
Hence x= 7(Gct' +2’). 


Eliminating «’ in Eq. (3.5) with the help of Eq. (3.4) we get 


a = 7°(a — Bct) + yBet'. 
2 (3.6) 


Yt 5 = (ct — Be), 


Or ct! =yct— 


where we have made use of Eq. (2.22). 
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Alternatively, one can eliminate x in Eq. (3.4) with the help of Eq. (3.5), 
leading to 


g' = 77 (Bet! + 2’) — yBet. 
Pal / / / (3.7) 
ap x’ = y(ct' + Ba’). 


Equations (3.6) and (3.4) represent the equations of transformation from 
(ct, x) to (ct’, 2’). Equations (3.7) and (3.5) constitute the inverse transfor- 
mation, i.e. from (ct’, x’) to (ct, x). 


y 


Or ct=yct! + 


The above equations are supplemented by the equations of transforma- 
tion for the y and z coordinates corresponding to a more general event “®” 
having arbitrary non-zero values for all the four coordinates. Therefore, let 
a lightning “®” strike an arbitrary point T, which for the convenience is 
identified as the top of a tower. We assign to this event the coordinates 
(ct, x,y,z) in S and (ct’,a’,y’, 2’) in S’. 

We have shown the event, as viewed from S, in Fig. 3.3. (Note that 
neither of the S and S’ frames is a rest frame of the tower.) Assuming 
that the base of the tower is on the X Z-plane, the height of the tower 
should be y in S and y’ in S’. According to Rule 4, y = y’. By a sim- 
ilar argument, z = z’. Let the tower top T be projected to the mark Q 
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Fig. 3.3. Lightning striking a tower. 
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on the X-rod and Q’ on the X'-rod, as shown in Fig. 3.3. Then the event 
“Q” = “Q passes Q’” is the same event as represented in Eq. (3.3) whose 
coordinate transformation we have just worked out. Summarizing all that 
we have so far accomplished, we can write the transformation equations of 
the coordinates for any arbitrary event “®” as follows: 


(i) Transformation equation from S to $”: 


The transformation equations (3.8) and (3.9) are the simplest exam- 
ples of Lorentz transformation. We shall refer to them as the standard 
Lorentz transformation corresponding to the standard configuration shown 
in Fig. 3.2. 

Two inertial frames of reference S and S$’ that are connected to each 
other by a Lorentz transformation — as in (3.8), (3.9), or more generally 
as in (3.18), (3.19) — have an alternative name: Lorentz frames. 


3.2. Lorentz Transformation II: General Case 


As we found out in Sec. 3.4, one reason why GT is not acceptable within 
the scheme of special relativity is that it yields different speeds of light in 
different inertial frames, whereas Corollary 2A in Chapter 2 requires frame 
independence of the speed of light. It is now incumbent on any relativistic 
transformation equation to meet this requirement of Corollary 2A. 
Consider an event “E”, namely “emission of a flash of light”, which takes 
place at the origin of S at ct = 0. Consider another frame S’ obtained 
from S by a more general boost, as defined at the beginning of Sec. 3.7. 
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By assumption (b), the event takes place also at the origin of S’ and at the 
time ct/ = 0 (as measured by S’.) Therefore, the coordinates of “E” are: 


bee (0,0,0,0) in S 
' (0,0,0,0) in S’{- 


Let “R” represent a subsequent event namely “reception of the flash of light 
by an observer”. Let its coordinates be 


Therefore, Corollary 2A in Chapter 2 requires that the following two equa- 
tions be both satisfied [refer to Eqs. (2.14) and (2.16)]: 
et? — (a? +y? +27) =0, 
(3.10) 
ct? — (a +? + 2) =0. 
Thus, we define Lorentz transformation to be a linear transformation 


between the Cartesian coordinates (ct’, x’, y’, 2’) and (ct, x, y, z) of any event 
“P” such that 


Cr = (a? a y? oe z*) = Ct? = (i he y’” fe 2), (3.11) 


(Properly speaking, what we have defined is a homogeneous LT which 
corresponds to the fact that the clock in either frame is set to zero when 
the origins O and O’ pass each other.) The adjective “linear” used in the 
above definition implies that each one of the coordinates (ct’, x’, y’, 2’) is 
a linear function of the coordinates (ct,x,y,z) and vice versa. This is the 
same thing as saying that there exists a matrix Q, with constant elements 
Qi; such that 


cr. Qoo Qor Qo2 Qos ct 

= _ Qio Oi Qig Qs v (3.12) 
y Q29 O21 O22 N23 y 

z! Q30 O31 O32 033 


We shall call this 2 matrix the Lorentz transformation matrix. Here note 
that the rows and columns of the above matrix have been indexed by the 
numbers 0,1,2,3. This is in conformity with the indexing convention we 
shall adopt in Chapter 7. By the way of illustration, we shall retrieve the 
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LT matrix associated with the boost cG in the X-direction, out of Eq. (3.8), 
which can be rewritten in the following matrix form: 


ct! Y -76 0 0 ct 
/ _ 0 O x 
2s a (a a (3.13) 
y! 0 0 10 y 
z! 0 Oo oO 1 


Hence, the Lorentz transformation matrix associated with a boost c@ in 
the X-direction is as follows: 


y =e O10 

. 2 0 0 

@.|P 7 (3.14) 
0 oO 10 
0 oOo 01 


We shall now prove that the Lorentz transformation represented by 
Eq. (3.8), which is also equivalent to Eq. (3.13), satisfies the requirement 
of Eq. (3.11). 


Proof of Eq. (3.11). 
24? — (a! + yl + 2) = {y(ct — Ba)}? — {y(e— Bet)}? — y?— 2? 
= 72(1 — Be? — 92(1— B%)u? — y? — 2? 
= ct? — (g7 +4? + 27). (QED) 


Let us now take up the case of a general boost {.5(3)S’}, as illustrated 
in Fig. 3.4. In this case, the boost velocity is G, in any arbitrary direction n. 
That is, the velocity of the origin O’ of the frame S$’ is 8 = 8n with respect 
to the origin O of the frame S. Here n is a unit vector in the direction of G. 

We have set up a new set of coordinate axes (€,¢,7) to replace (x, y, z) 
in S, and (€',¢’,7') to replace (a’, y’, 2’) in S’, such that (i) the axes € of S 
and €’ of S" coincide and are oriented in the direction of @, (ii) the origins 
O and O’ coincide at t = t! = 0. We shall adjust the simple LT given in 
(3.8) to the general LT pertaining to this general case. 

Let r and r’ represent the radius vectors from O and O’, respectively, to 
the location of the event ©, as measured in S' and S$’, respectively. We shall 
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Fig. 3.4. A general boost. 


resolve these vectors parallel to and perpendicular to the direction n: 
r=fe.+¢e.+ne, =€n+r, =r, +7r., (3.15) 
where 


r, =€n=€e,, the component of r parallel to GB, 


r, =Cec+ne,, the component of r perpendicular to (. 
Now, we shall obtain the analogues of Eq. (3.8): 


(3.8a) = ct! = y(ct — BE) > ct! = 7(ct- B-r), 
(3.8b) = € = 7(€ — Bet) > vr, = fn =4(€ — Bet)n (3.16) 
= = 7(r| — Bct), 
(3.80) and (3.8d) > (C47) = (G9) > r', =Tis 
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From the last two lines, 


r=rt+r, =(9r, +11) — Bet 
= (ry +r) + (y- 1)r) — Bet 
1)r, — yGet 
1)(r-n)n — yfct. (3.17) 


Noting that n = 5, we can summarize as follows. The Lorentz transforma- 
tion corresponding to the general boost {S(3)5’} has the following form: 


ct’ = y(ct — Br) (3.18a) 
= ct + [(7 —1)ct — 78-4]; (3.18b) 
r= (qr, +r) — yGct (3.18c) 
a4 1 (8-1) — Bet (3.18d) 
=r+([(7—-1)(r-n) — 76ct)n. (3.18e) 


We have written each transformation in two or three different forms, so 
that the reader will have several choices for different applications. 

The inverse of the above transformation will correspond to the boost 
{5”(—)S}, and can be obtained from (3.18) by exchanging (ct,r) with 
(ct’,r’) and replacing B with —: 


ct = ct! + [(y — lect’ + 78-1’), (3.19a) 


y-1 
We have used the coordinates (€,¢,7) and (&’, ¢’, 7’) as a tool for obtain- 
ing the relation (3.18). Now, we discard them, and interpret r to mean 
(x,y,z), x’ to mean (2’,y’, 2’). Moreover, 3 is the boost velocity with com- 
ponents (8z, By, 8z), and @-r stands for xB; + yBy + zBz. 
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The Lorentz transformation matrix corresponding to Eq. (3.18) is as 
follows: 


i" —7Bx —7By —Bz 
ape tiem 2a. 2s 
F VPa Be x Be yPa Be zPex 
Q= y—-1 y-1,, y-1 . (3.20) 
—7By “pe Peby 1 “ar Py “ge BaP y 
= = = 
—yBz “Fr abs 168. 1 + ae : . 


To see that “the boost in the X-direction” is a special case of Eqs. (3.18a)— 
(3.18e), one needs to set 6 = (6,0,0) in these equations to get back equa- 
tions in (3.8). 

All relativistic formulas must reduce to the corresponding non- 
relativistic forms in the limits of small velocities. This is a general require- 
ment which we shall use from time to time as one of the checks whenever 
any new result will be derived. In the present case, the non-relativistic limit 
implies y ~ 1. Using this approximation in Eq. (3.18), it is easy to see that 


ct’ = ct, 
(3.21) 
r =r—cftt. 


These are identical to the homogeneous form (2.2), i.e. corresponding to 
ro = 0 if we recognize that u = cf. 

It will be useful to note at this point that the requirement of Eq. (3.11) 
is satisfied also by a pure rotation, i.e. the transformation in which the 
(a2’,y’, 2’) coordinates are obtained from (a,y,z) (or vice versa) by a 
rotation of the axes XYZ to X'Y'Z’, thereby leaving ct! = ct. Such a 
transformation of (x,y,z) > (a’,y’,2’) falls under the general class of 
three-dimensional orthogonal transformation (see also Sec. 7.5). In fact, 
the definition (3.11) is also satisfied by a pure space reflection, i-e., 
vo = —2,y' = —-y,2' = —z,ct! = ct and a pure time reversal, i.e. 
vo =a2,y = y,2' = 2',ct’ = —ct. What we shall call Lorentz transfor- 
mation in this book will include (a) pure boost, (b) pure rotation and 
(c) a combination of boost and rotation, but will exclude space reflection 
and time reversal. Such a transformation is normally called proper Lorentz 
transformation. It will be a useful exercise for the reader to show that the 
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effect of two successive boosts is in general equal to one boost and one 
rotation, so that boost and rotation are in fact inseparable.* 


3.3. Simple Applications of Lorentz Transformation 


In order to illustrate the meaning of LT, we shall use Eqs. (3.8) and (3.9) 
to retrieve the time dilation and length contraction formulas derived in 
Secs. 2.6 and 2.7. 

Consider two events “A” and “B” having coordinates 


SAR? —_ 
“BRB” _ ia 


We can obtain the transformations of the coordinates of “B” and 
“A” and take the difference to get the transformation of the coordinate 
differences: 


ct, £,Y, 2) in 9, 
ct’,a',y’,2’) in S’, 
ct 4 ,o + Aw, y + Ay, z+ Az) in S, 


( 
( 
( 
( 


ct! + cAt’, 2’ + Ag’, y’ + Ay’, 2’ + Az’) in S$". 


cAt! = 7(cAt — BAc), 
Aa’ = y(Aa — BcAt), 


(3.22a) 
Ay’ = Ay, 
Az’ = Az, 
cAt = y(cAt’ + BAz’), 
Ag = 7(Aqg’ + BcAt’), 
aoe cae ee (3.22b) 
Ay = Ay’, 
Az = Az’. 


These equations mean that the coordinate differences follow the same 
Lorentz transformation as the coordinates themselves. 

Now consider two events “A” and “B” which occur at the same spatial 
location (2, y’, 2’), but at two different times ct’ and ct’+cAt’, as seen from 
the frame S’. In that case Az’ = 0, so that At’ is the proper time between 
these events. The corresponding time interval At measured in the frame S 
is the “improper” time, and is obtained from the first one of transformation 


aSee Ref. [34, Sec. 2.8]. 


Lorentz Transformation 59 


equations given in (3.22b), 
At = yAt! = yAr. (3.23) 


We recall Rule 2 given in Eq. (2.23). Note that we have used the symbol 
Ar to mean proper time between the two events. 

Let us elucidate with a commonplace example. A train arrives at Bhopal 
at 10 hours, and at Nagpur at 18 hours. These two arrivals are examples of 
the events “A” and “B” considered here. They occur at the same spatial 
location (2’, y’, z’) with reference to the frame $” of the train but at different 
spatial locations with reference to the frame S of a station master which 
is fixed on the ground. The time interval At! = Av = 6hours as noted 
by a passenger of the train is the proper time between reaching Bhopal 
and reaching Nagpur. The corresponding time At that is recorded by the 
station master is in this case “improper time” and will be more than 6 hours. 

Let us now consider the length measurement of a moving rod AB using 
shadow-graph in a laboratory S, as illustrated in Fig. 2.8. Two flashes of 
light emitted simultaneously by the flash guns G; and G2 graze past the 
endpoints A and B of the speeding rod and etches marks P,, P2 on the 
experiment table, as illustrated in Fig. 3.5. 

Two events are involved in this measurement: “A” = “A coincides with 
P,” and “B” = “B coincides with P,”. The flash guns are riveted to the 
experiment table in the Lab frame S and the points P,, P: are fixed points 
on this table. The distance between them is Ax = L in S. Also they take 
simultaneously in S (but not in $’), so that At = 0. 

The rest frame S’ of the rod is moving with velocity 8c with respect 
to S. The endpoints A and B of the rod are the locations of the events 
“A” and “B” as seen from S’. The distance Ax’ = Lo between these events 


Fig. 3.5. Length measurement using LT. 
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is the “proper length” of the rod, which is now determined from the second 
one of transformation equations given in (3.22a). 


Aa’ =yAz, or Lp =74L. (3.24) 


We recall back the length contraction formula (2.27). 


3.4. Time-Like, Light-Like, Space-Like Intervals 


The coordinate difference between two events satisfies the Lorentz trans- 

formation according to Eq. (3.22). Hence, the property (3.11) should also 

be shared by the coordinate difference, i.e., 

del As? 
(3.25) 


At? — (Aa? + Ay? + Az”) = At? — (Aa + Ay”? + Az”) 


We have denoted this invariant quantity by the symbol As?. The quan- 
tity As as written above will be called the square of the interval, or (in order 
to be brief) just the interval, between the two events. This interval will be 
called 


e time-like, if As? > 0; 
e space-like, if As? <0; 
e light-like, if As? = 0. 


The significance of these strange names can be explained as follows. 


e If the interval is time-like, then we can find a frame of reference S$’, 
moving with a boost velocity ¢3; 6 < 1, with respect to S, such that the 
events occur at the same spatial location, say the origin, but at different 
instants of time, say, t4 and tg =t4 + Ar, where Az is the proper time 
between the events. In this case, Av’ = Ay’ = Az’ = 0 and cAt! = cAr. 
In other words, the coordinate difference has only time component, but 
no space component. Hence, the name time-like. 

Note that, for a time-like separation between two events, Ar = As. 

e If the interval is space-like, then we can find a frame of reference S$’, 
moving with a boost velocity ¢3; 6 <1, with respect to S, such that the 
events occur at the same time, say at t! = 0. That is, they are simultane- 
ous, but occur at different spatial locations, say, x’, and z/, = 2’, + Al. 
In other words, the coordinate difference has only space component, but 
no time component. Hence, the name space-like. 

Note that, for a space-like separation between two events, there is no 
proper time, because As is imaginary. 
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e If the interval is light-like, then there is no frame of reference in which 
the events can occur at the same spatial location. In other words, the 
two events cannot be connected by any frame of reference. Only a light 
signal, e.g. “6,” = a radio message is sent; “62” = the same message is 
received, can connect the events. In this case, the proper time Ar = 0. 


3.5. Relativistic Doppler Formula 


As another interesting application of LT, we shall obtain a relativistic for- 
mula for the Doppler effect. Figure 3.6 shows a transmission tower of height 
h moving in the X-direction with velocity cG. At the top of the tower is the 
transmitter K. Fixed on the ground is the receiving station O. Moreover, 
S and S’ are, respectively, the rest frames of the receiving station and the 
transmitter. Consider the following events: 


e “Ox,” = “the transmitter transmits a sharp beep signal”; 
e “®,” = “the receiver receives that signal”. 


We write event coordinates as (ct, x,y), leaving out the z coordinate. 
Noting that « = ct, we write 


vo.» _ { (ct 0h) ins’, 
_ (ct, Bct,y) in S. 


Niven 
= 


AMA 


\ 


Fig. 3.6. Explaining Doppler effect. 
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Applying the Lorentz transformation equation (3.9) to the time coordi- 
nate and to the y coordinate, we get 


ad=7d: =k (3.26) 


Since the reception takes place at the origin of S, we can write its 
coordinates as “®o” = (cT,0,0) in S, where T is the time when the signal 
is received. The signal has propagated from K to O, a distance of Vx? + h? 
in time (J —t). Therefore 


(Sct)? +h? = c?(T -t)’. (3.27) 

Using Eq. (3.26), we get 
42 B2et? +h? = 2(T — yt"). 
Simplifying, 
2 2 h? 
/ Pe 

T*+¢* —2yTt = @ (3.28) 
which is the relation between the transmission time t’ and the reception 
time T, each measured at its respective station. 

Now, let there be a continuous beam of a sinusoidal radio wave trans- 
mitted from the moving tower, whose frequency is measured to be fo at 
the transmission station and f at the receiving station. The wave consists 
of a series of crests separated by a time interval which equals dt’ = 7 
and dT = + as measured at the transmitting and at the receiving sta- 
tion, respectively. Therefore, by differentiating (3.28) we get the required 
relation between fo and f: 

ar yr! 


aa (3.29) 


It follows from Eqs. (3.26) and (3.27) that T— yt) =T-t=,/=" =4, 
where J= length of the hypotenuse OK. Also 


t 1 
la ae sr 


co . t) = (P — 8) + 64] = 20+ Be). 


dT 


Ta y(1 + Bcos8), (3.30) 
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where @ is the angle of elevation of the transmitter when viewed from O. 
In other words 


Fe fo 
~ (1+ cos dé)’ 


Equation (3.31) represents the relativistic Doppler formula. The angle 6 is 
to be interpreted as the angle between the direction of motion of the source 
of light and the ray line. The reader must have realized that this formula 
is valid only for light transmitted by a moving source. 

We shall write l = - and specialize the formula for three special cases: 


(3.31) 


Lp 
6 = 0 longitudinal; source receding T= ¢/-—<$S, 
P . 1+B 
T ; -~ 1 
d= 3 transverse; source vertically up T = —-, (3.32) 
Y 
= 1 
6 = 7 longitudinal; source approaching T = — 
For non-relativistic Doppler effect, we set y = 1, and get 
fo 
= —_ 3.33 
I (1+ Gcosé@)’ Ae) 


which is valid when the transmission tower (or the source of the wave) is 
moving with speed v < c. This formula is valid for sound as well if we 
substitute for c the speed of sound [11]. 

The simple look of the Doppler formula (3.31) is misleading. In actual 
application, it can present conceptual and mathematical challenges. We 
have presented two examples for a better appreciation of the formula. One 
of them is a detailed workout in Sec. 8.9.5. The other one is Exercise R3 
for the reader, in Sec. 8.10. 


3.6. Worked Out Problems I 


Problem 3.1. Let R and R’ be two straight rods sliding past each other 
with relative speed c@ as shown in Fig. 3.7. Prove the following theorem 
as a corollary of Theorem 2.1. 


Theorem 3.1. Let the segment AB of R coincide with the segment A’B’ 
of R! when viewed from R (Fig. 3.7(a)), and with the segment A’B” of R' 
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View from R at ct=ct’=0 


ie. U Bw 
: ye 
(3) o Rd > 
\f 
AN 'B’ cB 
— ae 
a (a) 
0 ! 
View from R’ at ct=ct’=0 
) ; 0) 
‘ 4 (already happened) 
_ ¥ ie 
: e" : 
cB AY R ,B” B 
=<=>__— af R - (7 
Ai B (b 
, 
\3 
A l B 


Fig. 3.7. Problem 3.1. 


when viewed from R! (Fig. 3.7(b)). If & and l" be the proper lengths of 
A’B’ and A'B”, respectively, then 


1 
f= a (3.34) 


Solution to Problem 3.1 


The proper length A’B’ shrinks to proper length AB when seen from R. 
The proper length AB shrinks to proper length A’B” when seen from R’. 
Let £ be the proper length of the segment AB. 

Hence, by Theorem 2.1, 


_~ A’B’ t 
B= , or, €=-, 

Y Y 

wera 8B gn, wat, 
os y 
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Therefore, 


Problem 3.2. Prove the following theorem as a corollary of Theorem 3.1. 


Theorem 3.2. Consider two events “0” and “dé” occurring at the locations 
(A, B) on R and (A’, B’) on R’. Let these events be simultaneous when 
viewed from R (so that ctg = cts). Then, the time interval between these 
events, when viewed from R’, is given by 


c(tg — ty) = BE, (3.35) 
where ¢' is the proper length of the segment A‘B’. 


Solution to Problem 3.2 


(a) Solution using length contraction formula 
In the following, we shall write (A : A’) to mean the event: A passes A’ = A’ 
passes A. 

Consider the following three events 6 = (A: A’), ¢ = (B:B’), vw = 
(Be BR"), 

Seen from R, @ and ¢ are simultaneous, so that ctg = ctg. 

Seen from R’, 6 and w are simultaneous, so that ct, = ctiy. 

Seen from R’, as the rod R moves to the left, the event ¢ precedes the 
event w. 

According to Theorem 3.1, and Fig. 3.7, the proper length of B”B’ 
equals f’ — 0" = (1— 4)0! = Be. 

The mark B on the rod R, moving to the left with velocity c@, covers up 
the proper distance B’B’ in time At = oe a ee. and the event w occurs 
after the event ¢, so that t), > ty. 

Therefore with respect to the frame R’ 


c(tg — ty) = c(t, — t4) = BY’. 


(b) Solution using Lorentz transformation 

We have pictured the standard configuration in Fig. 3.8. The “moving” rod 
R’ is stationary in the frame S’, and the rod R is stationary in the frame 
S. The events @ and ¢ are simultaneous in S. Their coordinates (ct, 2) in 
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) 0) 

A’ iB’ x’ 
t R’ -_ > cB xX 
—, t 


Fig. 3.8. Problem 2.2. 


S and S’ are written as follows: 


0 = (cto, x9) = (0, 0), @ = (ctg, xg) _ (0, £) in S 


(3.36a) 

6 = (ct, 4) = (0,0), b= (cty,a,) in S’ — (3.36b) 

Apply LT: ct’, = (cts — Bag) = (0 — Be) = — 788. (3.36c) 
x's = (xg — Bets) = 7(€— 8 x 0) = 2. (3.36d) 


Note that L's = ¢’, i.e. the proper length between A’ and B’. Therefore, 


from Eq. (3.36d): ’ = 2; 
from Eqs. (3.36b) and (3.36c): c(t, — t,) = 0 — (—yBe) = 7Bl = Be’. 
(3.37) 


Problem 3.3. Prove the following theorem. 


Theorem 3.3. Let “0”, “dé” and “w” be three events, such that “0” and 
“®” occur at the same spatial location in S, “8” and “w” at the same spatial 
location in S'. If the time intervals, as measured in S", are T{ between “6” 
and “”, and T; between “6” and “wW”, and if “d” and “wW” are simultaneous 
in S, then 


_f 


. (3.38) 
‘ 


T} 
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Compare Eq. (3.38) with Eq. (3.34). (See Example 3.5 for illustration of 
this theorem.) 


Solution to Problem 3.3 

(a) Solution using time-dilation formula 

In Fig. 3.9, we have explained the configurations of the frames S and S’ 
at the three events (0, ¢, w), which we identify with three lightning strokes 
striking at different times as shown in the figure. We take the timing of the 
event 0 to be Top = Tj = 0, that of the event ¢ to be T;, T] and of the event 


Xe. 1X 
(s] 
(b) 
proper 
time 
X x? 
Y 
(s 
cB 
a) (c) 
t proper 
time 
= 7, 


Fig. 3.9. Problem 2.3. 
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w to be To, T4, with respect to S and S$’, respectively. Then 


T, = proper time between @ and ¢, so that T, = T//7 
T3 = proper time between 0 and w, so that TS = To/y 
Also, T, = T>2 by assumption (3.39) 
Hence, TY/y = 71 
Or, Tat fae, 
(b) Solution using LT 
6 = (0.0); @ = (cT1,0); YW =(cT2,273) in S 
6 = (0.0); d= (el" 1,24) w = (cTS,0) in S’ 
Apply LT: eT] = y(cTi — 8 x 0) = eT) (3.40) 
cT2 (cT3 + B x 0) = ycT3 


Apply inverse LT: =7 


By assumption: Tg=T,. Nenee, ef = yelp = 4" cTh. 


Problem 3.4. (a) Rewrite Eq. (3.12) in the form: 


= 0X, (3.41) 
where 
ct’ ct 
x! . a * 
= eats = 
¥y y 
2 Zz 


are column matrices. Let X' = (ct’,2’,y', 2’), X = (ct, x,y,z) be transposes 
of X’ and X, respectively, and let 


0 

: C =1 6 

G= (3.42) 
fe «1 0 
G @ ai 


(This G is identical with the metric tensor to be discussed in Sec. 7.9.3.) 
Show that Eq. (3.11) can be expressed compactly as follows: 


X'GX = XOX, (3.43) 
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(b) Hence, establish the following important property of any LT matrix: 
OT GO = G, (3.44) 
where Q7 represents the transpose of (2. 


Solution to Problem 3.4 


(a) 


1 0 O 0 ct! 
0 -1 0 O nid 
Lh.s. = (ct’, 2’, y’, 2’ 
( oe. Por / 
0 0 -1 zs 


= ct? — g!? — yl? — 2. 


2 42 2 2 


Similarly, the r.h.s = ct? — 2? — y? — z?. 


By assumption |.h.s = r.h.s. Hence the identity. 


b ‘ 
(b) ; 
XIGX! = (KATA) = KTEX =KEX. (3.45) 
Hence, QT GO = G. 


Problem 3.5. Let “0,”, “02” be simultaneous events in S occurring along 
the X-axis. Show that they remain simultaneous events in S’ under the 
boost: $(0,¢G,0)S’, or more generally under the boost: $(0,c8y,c.)S". 


Solution to Problem 3.5 
Refer to the Lorentz transformation (3.18) for the general case. Note how 
time transforms: 


ct! = y(ct — B-r). (3.46) 


Suppose the first event occurs at the common origin of the frames S and S’ 
when the time is set to zero in both frames. For the second event 6-r = 0 
according to our assumption. Hence, ct’ = 0. 


Problem 3.6. As shown in Eq. (3.22) the coordinate differences between 
the events “6,”, “@2” undergo the same LT as the coordinates. We shall 
rewrite the same equation more clearly as follows. 
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c(t — t) = y[e(te — t1) — B(w2 — 21)], 


) 

(a5 — 2) = y[(2 — 21) — Be(te — t1)), 
) 
) 


Using the above transformation equations, prove the following statements: 


(a) There exist frames A and B having inverse time relations for the events 
“6,” and “9” (i.e. if “0,” happens before “62” in A, then “0,” happens 
after “O02” in B), if and only if there exists a frame S in which these 
events are simultaneous. [In fact there exists an infinity of such frames. 
See Example 3.11.] 

(b) The temporal sequence of the events “6,” and “62” is the same in all 
frames (i.e. if “8;” occurs before “O2” in some frame A, then the same 
must be true in all frames) if and only if there exists a frame S in which 
these events occur at the same spatial location, in which case the least 
time interval between the events is the (proper) time measured in S. 


Solution to Problem 3.6 
(a) Set c(t2 — t1) =0 in Eq. (3.50a). Get 


c(t, — t)) = —yB(x2 — 21). 


a . (3.48) 
Therefore, t —t; <0 for 8 positive/negative. 
(b) Set (a2 — 21) = 0 in Eq. (3.50a). Get 
c(t, — th) = ye(te — t1). 
(ts — t1) = ye(te — tr) (3.49) 


Therefore, t4 — t, > 0 for any value of y, since y > 1. 


3.7. Illustrative Numerical Examples I 


Take c = 3 x 10° m/sec for numerical exercises in this chapter. We have 
used “light-second” (abbreviated as lt-sec) as an alternative unit of length. 
1 It-sec = 3 x 108m. While working out the exercises, the reader may find 
it convenient to convert “lt-sec” unit to “meter” unit by multiplying with 
c. For example, 5.2 lt-sec = 5.2cm. 

Examples 3.1-3.6 allude to an imaginary superfast (and superlong) 
space-train called Cosmos Express shown in Fig. 3.10. It is coasting with 
a uniform speed 8 = 3. Its (proper) length is £ = 6 x 10° m (ie. 2 It-sec). 
Along its straight line route lie two space stations named Andromeda and 
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D=10 It-sec £ =2 It-sec B=4/5 d=1.2 It-sec 


Cosmos Express 


£ 


Fig. 3.10. Cosmos Express running between S and V. 


Vega (no relative velocity between them) manned by the respective station 
masters Amar and Vivek, to be referred as Mr A and Mr V. Their offices 
(shown by the letters A and V) are D = 3 x 10°m (i.e. 10 lt-sec) apart. 
Peter, the passenger (to be referred to as Mr P), is travelling in the centre 
coach. The intervening terrain between A and V being treacherous (occa- 
sional attacks by space pirates), Peter’s mother prays for his safe passage, 
as she sits quietly at A. Clocks in the space station frame S and the train’s 
frame S’ are set to zero, ie. t = t/ = 0, when “P passes A”. At the same 
instant “mother starts praying”. 

In the following questions, all primed quantities (e.g. T’, D’) refer to 
measurements in 5”, and the unprimed quantities to those in S. All mes- 
sages between individuals or stations are radio messages (so that they 
are transmitted with the speed of light). Answer the questions asked in 
Examples 3.1-3.5 by applying Rules 1, 2 and 3 (i.e. without applying LT). 


Example 3.1. “Mother finishes praying” at t = T,, when “P passes V” 
(i.e. when Peter has reached Vega), according to clocks in S. At the same 
instant “she sends her good wishes” to Peter. “Peter receives her message” 
at t= Ts, ti = Ts. Obtain T and T3. 


Solution. Let us first identify the events involved, and their coordinates 
(x, ct) in S and (a’, ct’) in S’. We use the symbol to mean that we are 
not interested in this particular coordinate: 

Qo = “P passes A” (x =0,ct = 0); (2’ =0,ct’ = 0); 

©, = “Mother finishes prayer” (x = 0, ct = cT\); (2’ = t,ct’ =cT}); 

Os = “P passes V" (a = Di ae = 13); (a = 0,c0 = cls); 


Qs = “P receives message” (x = t,ct = cT3); (x' =}, ct’ = cT5). 


72 Special Relativity, Tensors, and Energy Tensor 


Note that “Mother finishes praying” and “Mother sends message” are 
the same event ©,. In time 7}, the train has moved a distance D with speed 
Bc. Also, the events ©; and Q» are simultaneous events in S. Hence, 


D 10cm 
T, =1h= >= 7— = 12.5s. 
cB cm/s 
The message is travelling at speed c between 03 and © . It is sent when P 
has already moved a distance D. Therefore, the message travels an extra 
distance D over the train which is travelling at speed Bc. Hence, 


D 10cm 
c(T3 — T;) = D+ 8c(T3 —T,), > T3- Ty = @i—pe leaks = 50s. 
5 


Hence, 73 = T, + 50 = 62.5s. 


Note: None of the rules of relativity has been used in finding the above 
three answers. 


Example 3.2. Determine T}, using Rule 2. 


Solution. Note that T3 is the proper time between the events 03 and 
Oo (because Mr P is present at both events). We shall use Eq. (2.23) to 
relate T} and the “improper time” T3 between the same events which has 
been already found out in the last question. For this, we need the Lorentz 
factor ¥: 


y= ee = il T= 37.58. 


J1l- pinssts 3 | 


Example 3.3. From the time T3, Peter determines the time t’ = T/, when 
his “mother finished praying and sent the message”. Find out Tj. [Hint: 
Peter does not know relativity, but knows that his mother has been receding 
from him at the speed 8 = 4] 


1 1 5 ,. Ty 625 
5 
3 


Solution. In the time T], which is the time interval between Oo and 0, 
as measured in $’, the point A has moved with velocity —G@c (i.e. in the 
negative X’-direction) a certain distance a, i.e. from x’ = 0 to a! = —a. 
Therefore, 


4 
i= Gel, = raat 


In the subsequent time interval 73 —T{ (measured in S$’), P remains station- 
ary at x’ = 0 in his rest frame S’, while the message travels from a’ = —a 
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to x’ = 0 with speed c. Therefore, 


T! 37.5 
eT —T!) =a = BcT!. se : =~ = 20.838. 
( 3 1) B 1 al 1 eB 2 


Example 3.4. Find the time t/ = T} when “P passes V”. [Hint: The line 
segment AV has a contracted length.] 


Solution. At time t’ = T} the point A has moved in the negative X’ direction 
at a speed of Gc to cover a distance D’ which is the contracted length of 
the proper length D. Therefore, 
D D 10 
GR SD! aS Se ee. 
y 


Bey #cx3m/s 


Example 3.5. Summarize your answers obtained so far in the following 
tabular form: 


Duration of mother’s prayer 
according to S frame, T; =... 


according to S’ frame, Tj =... 
Duration of Peter’s journey from S to V 


according to S frame, Tj =... 


according to S’ frame, Tj =... 


Verify that Rule 2 is illustrated in both sets of answers, namely JT) = 
<Ty a =T. 


Solution. The proper and “improper” times between OQ 9 and ©, are Tj 

and Ty, respectively. So we expect that 4 = e Similarly, the proper and 
1 

“improper” times between Oo and Og are T5 and T2, respectively. So we 


expect that - a - 


' ™Y 3 #1 
T%=12.5s; T, = 20.838; == s=-. 
Ti 5 ¥ 


Ups ie. ee: 


/ 
ee 
T2 5 *y. 
Hence, our expectations are confirmed. 


Example 3.6. If the length of the space station V is 1.2 lt-sec, how long 
does the length of Cosmos Express take to traverse the length of V (a) 
according to S$? (b) according to S’? Answer by applying Rule 3. 
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Solution. (a) Let t be the required time in S. In this time, Cosmos Express 
travelling with speed Gc moves a distance equal to the length of the plat- 
form which is the proper length d plus the length of the train which is the 
“improper” length ¢’. Hence 


£  2em 


(= -=—— =12cm, (3.50a) 
sf 3 
Bet = U +d=1.2c+1.2c = 2.4cm, (3.50b) 
+d 24 2.4 
air a ae ie (3.50c) 


(b) Let t’ be the require time in S. In this time, platform V travelling with 
velocity —Gc moves a distance equal to the length of the train which is the 
proper length @ plus the length of the platform which is the “improper” 
length d’. Hence 


d 12 
df =5 =" =0.72cm, (3.51a) 
a 
Bet! =l+d' = 2c+0.72c = 2.72cm, (3.51b) 
Cid 972. 27 
= oF = =34s. 3.51c 


Example 3.7. Obtain the answer in (b) of the last question by applying 
LT. 


Solution. We have shown the two boundary points of the station V, the left 
boundary L and the right boundary R. The office V is located at midpoint 
between L and R. In order to apply LT, we have to identify the two events 
involved, namely, 0; =“E passes L” and Or = “G passes R”. The space 
and time intervals between these two events, as measured in S are: Ax = d 
and cAt = — which is same as the time ¢ obtained in part (a) of the last 
problem. We shall obtain the values of these coordinates in S’ applying the 
transformation of coordinate differences as given in Eq. (3.22) 


ct! = (oat ~ Bar) = 7] “" — pal =o(S-s)a 


B B B 

1 d +d 

== yl + “| = : 
B | 3 Bp 

We have used Eqs. (2.22), (3.50a), (3.51a). We get the same answer as in 

(3.51c). 
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Example 3.8. Muons (denoted by the symbol ~) are charged particles 
like electrons, but about 207 times heavier. They are present abundantly in 
the earth’s upper atmosphere, being caused due to bombardment of nuclei 
by cosmic rays. A muon decays spontaneously into an electron and a pair 
of neutrinos, the decay mean-life being 7) = 2.3 x 10~° sec in muon’s rest 
frame. However, they live longer as they travel through the atmosphere 
with relativistic speed very close to c. Their prolonged life is one confirma- 
tion of the time dilation formula. 


A balloon is sent up to a height of 2km in the atmosphere to measure 
the flux density of muons of average speed 0.995c. If a detector carried 
on the balloon counts 10 muons per minute, how many muons per minute, 
on the average, will the same apparatus count at the sea level? Assume that 
the muons are travelling vertically downward and that no muon is stopped 
by the atmosphere. {Hint: The number of muons fall off exponentially as 
eo, where to is the mean-life and t is the time of flight of the muons, both 
measured in the experimenter’s frame.] 


Solution. Let No be the number of muons to be detected by the balloon at 
the reference point A which is located 2 km above sea level. Let N, be the 
same number at the sea level. The number of muons decreases with travel 
time t from the point A downward, and is given as N(t) = Noe7*s, where 
to is the mean life in Earth frame. The mean life 7) given in the problem is 
the “proper” lifetime of an average muon between the events “birth” and 
“death”. The two mean lives tg and 79 are related by the time dilation 


formula. 
1 
y= ——— = 10, 
1 — (0.995)? 
ip = = 10 X2.3% 10° = 23x 10-“s, 
2 10° 
t = travel time of u~ = ee = 0.67 x 107°, 


~ 0.995 x 3 x 108 
N;= Noe™ #0 =10xe 33 =7.47 per minute. 
Example 3.9. Two events “A” and “B”, when viewed from S, are sepa- 


rated by a distance d = 2 lt-sec and a time interval t as given below: 


(a) t = 6 sec; 
(b) t = 2sec; 
(c) t = 1sec. 
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(i) Is there a frame of reference S’, for each one of the above three cases, 
in which the events occur simultaneously? If so, determine the boost 
velocity c3 of S” with respect to S. 

(ii) Is there a frame of reference S’, for each one of the above three cases, in 
which the events occur at the same spatial location? If so, determine 
the boost velocity cG of S’ with respect to S and the proper time 
between the events. 


Solution. For this and subsequent solution to problems in this chapter, we 
shall write AY to mean coordinate difference between two points B and 
A. Actually, this quantity is a 4-vector, stretching from the event A to the 
event B. The meaning and significance will be explained in Sec. 7.8. 


(a) Let the coordinate difference between the two events be AY = B— A. 
Then 


( 
AY’ = (6c, 2c) 
As? = (6c)? — (2c)? = (4V2c)? 
Ar = proper time = 4V2 sec. 
The interval is time-like. Hence, we have the following conditions: 


(i) The events cannot occur simultaneously in any frame. 
(ii) Yes, there exists a frame 5S” in which the events occur at the same 
spatial location. Let Gc be the boost velocity. Then, 


Ag’ =0=7(Ax — BAct) 


3 Ax Ze. 1 
Act 6c _ 3’ 
1 9 3 
= —————— - => oo. 
A 1 8 24/2 
9 


There is a proper time Ar between the events. Let us find it. 


3 1 
Ar = 7(Act — 6 Ax) = —=[|6-=x2 = 4/2 c¢ lt-sec. 
cAr = 7(Act — B Az) = ( 5 xaje J/2c It-sec 
Hence, Ar = 4\/2sec. 

Ar 4/2 2/2 
At 6 3 
formula (2.23). 


1 
—, consistent with the time-dilation 
7 


Lorentz Transformation 77 


(b) AY = (2c, 2c); As? = 0. The interval is light-like. 
(i) No. (ii) No. 
(c) AY = (le, 2c); As? = —3c?. The interval is space-like. 


(i) Yes, there exists a frame S’ in which the events occur simultane- 
ously. Let us find the boost velocity for the frame 5S”. 


cAt!=0=7(Act — BAz) 


8 Act Cc a 
Ar Qc” 
1 2 


2 
Az! = a2 — 4 x lc = V3c = distance between A and B in S’. 


This is consistent with the length contraction formula (2.26): 


Ar 2 


Av V3. 


(ii) No. 


Example 3.10. Two lightnings strike, leaving marks A and B on a straight 
railway track simultaneously according to the Station Master (Mr. S). A 
traveller (Mr T) travelling in a long Einsteinian railway train moving with 
velocity 0.6c records the timings of the events using his instruments. Let 
the distance between A and B be L = 2.4 x 10° km, as measured by Mr S. 


(a) Did the events occur simultaneously according to the measurements of 
Mr T? If not, which occurred earlier? 

(b) What is the time interval between these events according to Mr T? 

(c) What is the length of the track segment AB according to Mr T? 

(d) The lightnings also left marks on the wheels A’, B’ of the train. What is 
the length L’ of the segment A’B’ (i) according to Mr T? (ii) according 
to Mr S? 


Solution. 


(a) No. 
(b) L=2.4x 10%km = 0.8 x 10-? It-sec. = £¢ x 107? m. 


5 
B=0.6 =3/5;7= — = 5/4. 
V 25 
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(ct, x) 
In $8: a= 10) 
4 
= (0. oo x 10) 
ctl, =0 (3.52) 
/ 5 —2 —2 
In T: eae 0O— =x =cx 10 =-—zcx 10 
th —ty =—= x 107? sec. 


The time interval t), — t', is negative. Therefore, B occurs before A. 

(c) The ground segment AB is at rest with respect to Mr S. He mea- 
sures the proper length L for this segment. In contrast, Mr T sees the 
ground segment AB to be moving to the left. Therefore, he measures 
the improper length L’ for the same segment. 

By Eq. (2.26): L = yL’. Hence L’ = L/y = 2.4 x 10° x 4 = 1.92 x 
10° km. 

(d) The train segment A’B’ is at rest with respect to Mr. T. He measures 
the proper length @ for this segment. In contrast, Mr. S sees the train 
segment A’B’ to be moving to the right. Therefore he measures the 
improper length L for the same segment. 

By Eq. (2.26): €= yL = 3 x 2.4 x 108 x - = 3x 10% km. 


Note that 4 =”, as per Theorem 3.1. 


Example 3.11. An event “A” = (0,0,0,0) is followed by a second event 
“B” = (1,2,0,0), 1 sec later, the coordinates being measured with respect 
to a frame S in It-sec. Is there a frame of reference S’ in which 


(a) “A” and “B” are simultaneous? 
(b) the event “B” is followed by the event “A”, 1 sec later, ie. ,—-t, = 1? 
(c) the events “A” and “B” occur at the same spatial location? 


Find the boost velocity c6 of S’ relative to S in each of the above cases (if 
such a frame S’ exists). 


Solution. AY = (1,2) in S. Hence, As? = 1— 4 = —3. The interval is 
space-like. 
(a) Yes. 

In the second frame At’ = 0. Hence, 0 = y(1— 6x2) +8 =#4. 
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Y Y oB 
Ss g 
x ca 
u= cB 
v A B 
aa | V : v=c a 
\ ‘BOX, Yo) 
iN = ae ‘B xX B _ 
P* : a 
fe + P : 
A® ve 


Fig. 3.11. Rod falling vertically. 


b) Yes. To find this frame, note that cAt/ = —1. Hence 
( 


-1l=7(1- 6x 2) 
Or, = =1-26 
Squaring both sides: 1 7 6? =14+ 46? — 48 
Or, 587-48 =0 
Or, B=0, 5 
8B =0, gives At = 1. Same order 


4 
B= . gives At = —1. Reverse order. 


(b) No, because the interval is space-like. 


Example 3.12. Consider a straight horizontal rod moving vertically down- 
ward with uniform speed V as seen from frame S’ (see Fig. 3.11). Its ends A 
and B will therefore strike the horizontal floor (represented by the X-axis) 
simultaneously according to S’. However, when seen from S, the ends strike 
the floor at different times, and, therefore, the bar is not horizontal. Find 
the angle that the bar makes with the horizontal (measure positive angle 
anticlockwise from the X-axis.) 


Solution. To find the angle ¢, we need to know the coordinates of the ends 
A and B at the same instant of time (i.e. simultaneously) in S. 
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We know that the two events 4p = “A touches P” and gq = “B 
touches Q” are simultaneous in $”. Their coordinates in S” and S are as 
follows: 


(ct, x), (ct, x), 
OaP = (0,0 in Ss", ABQ = (0, 2) in S', 
(0.0) in S, (ctzg,%eq) in S. 


Using inverse Lorentz transformation equation (3.9), we get the values of 
(ctsq,€pq) in S: 

ctag = y(0 + Be) = BE, 

Taq > y(e a 0) = 78. 

Let us look at the rod from the frame S. At ct = 0, the left end A of 
the rod is at the origin, i.e. at the spatial coordinates (0,0). Let the right 
end B of the rod at the same instant be at the spatial coordinates (ao, yo). 
At ctgo, the right end of the rod is touching the floor. Therefore, in time 
{ctso = 7B}, the end B has undergone a spatial displacement from (xo, yo) 
to (yé,0). 

Seen from S', all points of the rod have a velocity v. The horizontal 
and the vertical components of this velocity can be found from the velocity 


addition formula (4.23): vz = cB; vy =a —ct. We have set v as the 
dimensionless velocity, like 6. V = cv. 
Therefore, 


to + & ) (Bc) = yf > xo = ye(1 — 67) = 


yo + eo) (-c*) =0> y = Bev. 
c 7 


Therefore, tan ¢ = = Byv. 
to 


Example 3.13. Distant galaxies are known to be receding from ours at 
very large speeds, as evidenced by the shift in the frequencies of the spectral 
lines from atoms towards lower values, a phenomenon known as red shift. 
A distant galaxy is recognized to be receding at a speed of 0.5c along the 
line of sight, from the measurement of the red shift of the sodium Dg line 
of wavelength 5890 A. Find the measured value of the wavelength. 


Solution. Use the Doppler shift formula (3.31). Note that ~ = 4, and 
p=5i7= Rio= = 0°. Therefore, 


d= 7(1 + Bcos0°)A, = V3 x 5890 = 10, 201 A. 


Chapter 4 


Relativistic Mechanics 


The previous chapter elucidated the relativity principle by first enunciat- 
ing the basic postulates and then showing their immediate consequences. 
One of these consequences, namely the Lorentz transformation, has played 
a crucial role in reshaping classical mechanics by redefining the fundamen- 
tal quantities, like energy and momentum, and then modifying Newton’s 
equations of motion. We propose to sketch in this chapter some of these 
revolutionary developments and structure relativistic mechanics as an out- 
growth of Lorentz transformation. Central to this development is the veloc- 
ity addition formula, which forms the topic of the next section. 

We shall adopt the following convention. The boost velocity, i.e. the 
velocity of S’ with respect to S will be denoted by u = c and the velocity 
of a moving particle will be denoted by v = cy in S and v’ = cv’ in S$’. 


4.1. Relativistic Form of Velocity Transformation 


The relativistic velocity transformation formula, generally known by its 
popular name Velocity Addition Formula, is a relativistic generalization of 
the Galilean velocity transformation formula shown in Eq. (2.8c). We shall 
first obtain the simplest special case by considering the boost:$(cG, 0,0)", 
corresponding to the standard configuration shown in Fig. 3.1. In Fig. 4.1, 
we have shown a moving particle, and its radius vectors r and r’ and its 
velocities v and v’, both with respect to the frames S and S’. 

Let us consider a particle moving along a certain trajectory which 
appears as © to S and b’ to S’ (Figs. 4.1(b) and 4.1(c)). Consider two 
points A and B on the trajectory of the particle and lying infinitely close 
to each other. The radius vectors of these points are r and r+ dr in S$ and 
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Fig. 4.1. Trajectory of a particle in the frames S and 9’. 


r’ and r’ + dr’ in S’. These points are reached at times ¢t and t + dt in S 
and ¢’ and t’+dt’ in S’. We can, therefore, say that “the particle arrives at 
A” and “the particle arrives at B” are two events, which can be designated 
by the symbols “O4” and “Og”, respectively, and whose coordinates are 
as follows: 


(a ” (ct, r) = (ct, x, Y, z) in S, 
Oa” = BIN ae, oD pal aD PN Bes SIP (4.1) 
(ct', x’) = (ct, 2’, y', 2’) in 3, 
(ct + cdt,r + dr) 
“Op = = (ct + cdt,« +dz,y + dy, z+ dz) in S, (4.2) 


(ct' + cdt’,r’ + dr’) 
= (ct! + cdt', x2’ + dz’, y’ + dy’,z'+dz') in S’. 


The coordinate differentials transform according to Lorentz transforma- 
tion, as pointed out in Eq. (3.22). Therefore, 


cdt' = y(cdt — B dz), (4.3a) 

dx’ = y(dx — Bcdt), (4.3b) 

dy’ = dy, (4.3¢) 

dz' = dz. (4.3d) 

Let the velocity of the particle at the event “O” be v = es as it appears 


to S, and v’ = — as it appears to S’. The Cartesian components of these 


Relativistic Mechanics 83 


vectors are as follows: 
( ) dx dy dz 
V = (Uz, Vy, Uz) = | —, —, 
_— dt’ dt’ dt)’ 
/ / / 
(=a a| 2 |. 
a a dt'’ dt'’ dt! 
Note that, in order to determine the velocity components in a given frame 
of reference, we have used the length and time intervals pertaining to that 
frame of reference only. We can now utilize Eq. (4.3) to connect the two 
sets of velocity components as follows: 


foe dx! —_cy(du — Bcdt) 


(4.4) 


edt (edt — Bdx) 


_ ised (4.5) 


where u = c@. In the same manner, we can obtain the transformation 
equations for the other components. Writing them together, we get the 
velocity transformation for this special case, namely, boost: S(v,0,0)S9" as 


Vez — U 


cas 


Alternatively, if we had started from the inverse transformation 
equations (3.9), we would have obtained the following result, which is the 
inverse of Eq. (4.6): 


Equation (4.7) effectively shows the result of adding velocity u to v, whereas 
Eq. (4.6) tells us the result of subtracting u from v. These results are 
distinctively different from their Galilean counterparts. These equations 
are often referred to as the velocity addition formulas (subtraction of u is 
same as addition of —u). 

We have now to verify that the transformation equations (4.6) and (4.7) 
satisfy two essential requirements of all relativistic results, namely that (i) 
they must reduce to the corresponding non-relativistic result in the limit 
of small velocities; and that (ii) if a particle is moving with speed c in 
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the frame 5S’, then it should move with the same speed c in the second 
frame S', and vice versa (so that light propagates with the same speed c 
in both frames). With regard to requirement (i), let us record here the 
non-relativistic limit criterion 


Whenvu<c, 8-0, y>1 (N.R. limit). (4.8) 
Applying the criterion (4.8) to Eq. (4.6), so that oe — 0, one gets 
Up = Ve —U, Vy = Vy, UZ = Vz. (4.9) 


This is the Galilean result (2.5). 
To meet the requirement (ii), we set v’ = (c,0,0) in Eq. (4.6) and get 


c+uwu 0 0 
Vy, = =C, i = - = 0, Vz = =), 4.10 
1+% "14+ 1+ % on 


so that v = (c,0,0). 

We shall now illustrate the velocity addition formula by showing that 
the speed of light c is the ultimate speed, i.e. no material particle can be 
accelerated beyond the speed of light. Suppose a particle is moving along 
the X-axis with speed 0.99c relative to the laboratory frame S. Imagine 
a second frame S’ which is also moving along the X-axis with velocity 
0.99c relative to the Lab frame so that the particle is at rest in S’. The 
particle is now accelerated (from the present zero velocity in S’) to velocity 
0.99c along the X’-axis in this second frame $”. According to the velocity 
addition formula (4.7), therefore, the resulting velocity of the particle in 
the laboratory frame S would be 


_ 0.99¢ + 0.99¢ 
°° T+0.99 x 0.99 


Therefore, our velocity addition formula effectively tells us that if we add 
velocity 0.99c to 0.99c, we get 0.9999494c, instead of 1.98c. However hard 
we may try to increase the speed of a particle, which has nearly reached 
the speed of light, it will still be never able to reach, or surpass, the speed 
of light. 

We can make the argument more convincing by analyzing the motion 
of a particle which is under a “constant acceleration” a in its instantaneous 
rest frame (to be abbreviated as JRF) along the X-axis. The term instan- 


= 0.9999494c, along the X-axis. 


taneous rest frame — also called the comoving Lorentz frame — means an 
inertial frame in which the particle is, at least momentarily, at rest. We 
shall explain the concept of IRF with the diagram shown in Fig. 4.2. 
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Fig. 4.2. Explaining Instantaneous Rest Frame, shown as 9’. 


Imagine an accelerating particle, e.g. a rocket. Its own frame of refer- 
ence (i.e. the frame in which it is always at rest) is an accelerating frame. At 
a certain event, say “O: rocket passes a space station A”, the rocket releases 
an iron cage with X'Y’Z’-axes welded on it. We label this frame as S$’. This 
frame S’ is not in acceleration. Hence it is an IF whose velocity is the same 
as that of R at the event “O”. Then S$” is an IRF of R at the event “O”. 
We can establish a Lorentz connection between S and S’ through the boost 
{S(6,0,0)S’}, where 6 = v/c. The following paragraphs will amplify the 
concept of IRF. 

In Fig. 4.3(a), we have shown the rectilinear trajectory of a particle 
accelerating along the X-axis, as seen from the laboratory frame S. The 
particle passes space stations A and B with speeds v and v + dv at times t 
and t+ dt. Call these events “O,4” and “Op”, respectively. 

Figure 4.3(b) shows the IRF 5S" of the particle at the event “Oy”. In 
this frame, the events “OQ 4” and “Oj,” take place at the times t’ and ¢’+ dt’, 
the particle’s speed at these events being 0 and dv’, respectively. 


du’ = adt’. (4.11) 


According to the velocity addition formula (4.7), therefore, 


/ dv' 2 
vtde= EE = (dv! 40) (1-2 *) v4 (1-5) dv’, 


14+2% C2 
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(s , trajectory 
= trajectory ‘es 
a ewe tA XK zero dv’ 
A B Oo See: @>.-----\------ 
i d A B 
t+at r +d 
»¢ x 
Z Z 
View from the Lab frame View from the comoving Lorentz frame 


(a) (b) 


Fig. 4.3. Lab frame and comoving Lorentz frame. 
so that 
y2 
_ ee ! 
dv = (1 =) adt’. (4.12) 


Applying the LT to the coordinate differentials (using the inverse of 
Eqs. (4.3)), we get 


cdt = Ts (c dt’ + da") (4.13) 
CF 


Now, the particle, which was momentarily at rest at t’, has covered the 
distance dz’ under a constant acceleration a in the subsequent time interval 
dt'. Therefore, dx’ = $a(dt’)? ~ 0, so that, from (4.13) 


o] 


dt = ———. dt’. (4.14) 


We shall introduce a new Lorentz factor, namely one associated with the 
velocity of a particle, and call it dynamic Lorentz factor, to be symbolized 
by capital gamma: 


(4.15) 


Note that the time interval dt’ is measured in the IRF. Hence it is the 
same as the proper time dv between the events “O,4” and “Og”. By virtue 
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of (4.15), Eq. (4.14) can be written as 
dt =Tdt' =T dr, (4.16) 


which is the same as the relation between proper time and improper time 
given in Eq. (2.23). Equation (4.16) will find many applications in the 
discussions to follow. 

Compare the upper case [ of (4.15) with the lower case gamma + 
defined in Eq. (2.20) to represent the Lorentz factor associated with the 
boost velocity, which we had called boost Lorentz factor. 

We can now rewrite Eq. (4.12) as 


oe 
dv = (1 - =) adr (4.17a) 
c 


2\ 3 
= (1 = =) adt = ~ dt. (4.17b) 
c T 


It is seen from Eq. (4.17b) that the acceleration of the particle, as seen 
from the laboratory frame, is constantly reducing, and is given as follows: 


3 
dv y2\? a 
a ae ( =) aie oa 


Note that as the particle reaches relativistic speed, finally approaching 
c, T + oo, and apap > 0. Even at a constant acceleration in its IRF, the 
particle does not accelerate at all in the Lab frame, as it approaches the 
speed of light. The formula (4.18) therefore safeguards one of the principles 
of relativity, that is vu Sc. 

If we rewrite Eq. (4.17) as 


d 
—_ = adt (4.19) 
(Lam) 
and integrate, subject to the initial condition: v = 0 at t = 0, we get the 
expression for the velocity in the Lab frame S: 


= at. (4.20) 


Squaring both sides and simplifying, we get 
c 


(+ ge) 


a 


(4.21) 


Ne 
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The result shows that a particle under “constant acceleration in its 
instantaneous rest frame” will asymptotically approach the speed c in any 
IF, say S, but will never reach it. See Fig. 4.5 below. 

The presence of c? in the denominators of Eqs. (4.6) and (4.7) make 
them appear somewhat clumsy. To make them look neater, we shall rewrite 
them in terms of dimensionless velocities 3,v, defined as 


u=c§=c(6,0,0), v= cv=C(iz, Vy, Vz). (4.22) 


With these modifications, Eqs. (4.6) and (4.7) take the following dimen- 
sionless forms: 


/ Vy — B / Vy / Vz 
v= —_, vy, = — >, “4 = | 4.23a 
[8 598) ave 8S) 
py! +B VK, yp’ 
xz = 7 > —- fa. op FAN? Zz — — 4.23b 
“Tap a4 up)’ (+B) oe 
4.2. Relativistic Form of Acceleration Transformation 
Let 
dv dv : a 
Q = — = C— SOV = C (Vz, Vy, Vz), 
dt dt (4.24) 
2 Sree 
il gg Ne 


represent the acceleration vector and its Cartesian components in S' and 


S’, respectively. Note that the “dot” (") means 4 in the first line and 4 
in the second line. Then 
. dv’, dv! /dt 
= i= - | = , 4.25 
Mi Cal “edt! /dt’ oe tT oo) 
Differentiating (4.23b) with respect to t 
dv, _ (1 _ 6?) 
dt (1—v,8)? 
 y2(q — 2? 
, ee (4.26) 
diy 7 Vy (1 — ve 8B) + Vyte 8 
dt ya _ Vz)? , 
dt’ 
Using (4.3a) <— = ye(1 — Bz). 
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We wrap up (4.25) and (4.26) to obtain the following acceleration transfor- 
mation formulas: 


Se Vy, 

Va = y(1 _ Vx B)]>’ (4.27a) 
+, _ Yy(1 — ve 8) + YyteB 

6 ST maa pelt) 
ahs es v,(1 — VB) + VzVz3 

= ua oo 


Let us specialize (4.27) to rectilinear motion, along the X-axis. In this 


case, only the line (4.27a) is relevant. We drop the subscript “,.”, and write: 
. V 
vy! = ————_ __ for boost: $(8,0,0)S", 
[y — vB)]s 
; (4.28) 
Vy 
v = ———__ for boost: $’(—8,0,0)S. 
a+ BP Pe 
In the second line, let S’ represent IRF. Then v’ = 0. We get 
yop 
Y = _-DlUc el 4.2 
us 73? (4.29) 


as in (4.18), except that now the acceleration cv’ in the IRF is variable. 


4.3. A New Definition of Momentum 


One of the consequences of the GT, as we saw in Sec. 2.2.1, is that the 
acceleration a of a particle is the same in all inertial frames. This invariance 
of acceleration is lost under the LT, as shown in Eq. (4.27). An immediate 
consequence of the frame-dependence of acceleration is that if we express 
Newton’s second law of motion as F = ma, and treat the mass m as 
invariant, in line with Newtonian assumption, then force also becomes frame 
dependent. A constant force F in one inertial frame would then appear as 
a variable force in another inertial frame. 

A more serious consequence of the velocity addition formula is that the 
laws of conservation of energy and momentum, which were seen in Sec. 2.2.2 
to be valid in all inertial frames under the GT, now appear to break down 
under the LT. It turns out that if the defining expressions for energy EF and 
momentum p are modified to meet the requirement of the ultimate speed 
c, then the frame independence of the conservation laws can be restored. 
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Fig. 4.4. Symmetric collision 


In the rest of this section, we shall consider a thought experiment to 
seek a pointer to the definition of linear momentum. With this in mind, 
let us consider the following example. An observer S watches an elastic 
collision experiment with two billiard balls A and B, each of mass m,, and 
assumed to be moving with relativistic speeds (Fig. 4.4(a)). For the sake 
of simplicity, we assume the collision to be taking place on the XY-plane. 
The velocities of the ball, before and after the collision, are denoted as 
u,4,Ug and vy, vp, respectively. We have assumed that the initial speeds 
of the two balls are equal and that they make equal angles with the X- 
axis. A straight forward application of the energy-momentum conservation 
equations (2.10) and (2.11) would lead to the symmetry relations: 


UA =UB=VA =UB. (4.30) 


Of the above four velocities, only v4 has both X and Y components 
positive. We shall write these components as cv, and cv, respectively, so 
that v4 = (V¥z,V,) is the dimensionless velocity of A after the collision. 
Referring to Fig. 4.4(a), we can now write the following components for the 
velocities involved: 


ug =C(Vz,—Yy), VA = C(Vz, Vy), 
i 7 (4.31) 
up =C(—’z,%y), VB = C(—Vz,—Vy). 


Now imagine another observer in a frame S’ which is moving relative 
to S with speed ua, = cv, in the direction of the X-axis (Fig. 4.4(b)). 
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Using the velocity addition formulas (4.23), we obtain the following com- 
ponents for the velocity vectors in S’: 


ul _ UWAr — UAx oe 
‘Ag = 79 9 = SY 
e 1 = fe 1-v2 
! WAy Vy 
LAy = =F 9 7 — C, 
70 J) 70-7) 
vy! = VAx — UAx = Vy — Vy — 
ae 1 —vartar/C? 1-1? , 
v4, = ee et Cc 
Ay (l= tastas/A) 1-12)” 
(4.32) 
ul, = UBsx — UA — —2Vy, “ 
ae l—vpetac/C?  14+027 
/ UBy Vy 
tp, = =, = C, 
w > GL tee). La we) 
/ UBa — UA —2Vy, 
UBe = To Tr 
1 = vV4ntAn/C 1+ 
ae ee ee 
By 9(1 — tet sa /C?) ql ue) 
where 
: (4.33) 
Via 
Hence, 
| ee Vy ) i ( Vy ) 
UA+=C 0, ’ VAzC 0, ’ 
( yl 92) yl) 
(4.34) 
: c Vy F Cc Vy 
2p, mae) 
uB 2 ( 4), VB + ( x ) 
We shall now put the non-relativistic momentum expression: 
p=m,v_ (Not Valid in Relativity) (4.35) 


to the momentum conservation test in the frame 5S” (conservation of 
momentum and energy in S had been assumed for obtaining Eqs. (4.30) 
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and (4.31)). From Eqs. (4.34), the total momentum of the system in the 
frame S’, before and after the collision, are seen to be as follows: 


p’ Gnitial) =m, (u’, ce u’,) 


Vy Quy, 
=mc|— = 
TER I) 
PO) = m,(v'4 + V5) 


2Vy QV, 
=m.c 4 —-——, —— 7 - 
1+v2’ 7(1— v4) 


Thus, P/@"2) 4 p/(nitial | sueeesting that the momentum conservation 
law breaks down if we adopt the definition (4.35). This calls for a revised 
definition for linear momentum. 

One may get a clue along the following lines. The transformation 
u= (4. ou +) ou = (4, a +) due to change of reference frame is 
a double transformation in the sense that the LT converts both the numer- 
ator dx and the denominator dt according to Eq. (4.3). It might have been 
better to adopt instead of the coordinate time dt, some other measure of 
the time interval which would remain invariant under the LT. The natural 
choice would be the proper time interval dr of the particle’s own motion. 

Referring to the trajectory of the particle shown in Fig. 4.1, we can think 
of a clock attached to the particle and being carried with it. The timing 
read on this clock is the proper time of the particle’s own motion (proper 
time, because the particle’s clock is present all along its own trajectory). 
This clock reads the timings at the points A and B (i.e. at the events 
Ox, and Og) to be 7 and t + dr, respectively. Therefore, the proper time 
interval between O,4 and Og is dr. Dividing the spatial intervals between 
these events by this proper time, we get a new measure of the velocity, 
which we shall call proper velocity and write as Vprop. The components of 
this velocity in the frames S and S’ will be 


dx dy dz ; dx’ dy’ dz' 
rop — re oer ome ner ee ) Trop — 3 ed ’ 4. 
Neen (F dt =) Bess ( dr’ dr’ dr 338) 


respectively. 
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Since a comoving inertial frame of the particle at the event O.4 is moving 
with velocity v with respect to S, and v’ with respect to S’, it follows from 
Eq. (2.23) that 


dr = — = (4.37) 


where 


— 
— 


—— (4.38) 


Loe tae 


are the dynamic Lorentz factors in the frames S and S’, respectively, as 
defined in Eq. (4.15). Therefore, from Eqs. (4.36) and (4.37), 


Vprop =P, V'prop = I'v! (4.39) 
Hence we try the following definition for momentum: 
P = ™M.Vprop = P'm.v. (4.40) 


For non-relativistic speeds, [ —> 1, so that the definition (4.40) converges 
to (4.35). 

We shall now show that this new definition (4.40) will ensure momen- 
tum conservation in the example illustrated through Fig. 4.4. For this we 
shall first compute all the I-factors, in the S’ frame, corresponding to the 
velocities written in Eq. (4.34). It is easily seen that the I-factors corre- 
sponding to u’, and v’4 are equal and therefore can be represented by a 
common symbol I“,. Similarly, the [-factors corresponding to uz and v’g 
are equal and can be represented by another symbol I’. These factors can 
be computed with the help of Eqs. (4.34) and (4.33) as follows: 


1 i vw ; ve , ve 1-v-v 
T3(v,) 2  O-v2)j2 1-2 1” 
1 v's 1 v2 
oo 28 oi ee a 
PZ) onl eto 
1 4 2 2 2), 2 
= G12: [(l+ v2 + 2v2) — {402 + (1 v2)vi }] 
(=) = 2 = 22) 
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Therefore 
1l-yp (1 — vz) 
Le ee ae 
(4.41) 
/ Lee “(1 eee) 


The new definition (4.40) yields the following momenta for A and B 
before the collision. 


initia 1— 
pi tal) Pum, = hs) (0. <n ;) m,C 


a eee 
1—ve Vy 


=| 0,- M,C 
Li 

piinitial) i i iti os y(1 + v2) i “) ne 
1- y2 = p2 iD 1 + Vy 

© y 

_ — 2We 
4 fl-2— ja 
Hence, the initial momentum of the system is 
/(initial) 2 
P = 0 > me. (4.42) 


/ 2 2° 
Lys =v; 


Similarly, the momenta of the particles after the collision are as follows: 


2 
(final) y(1— vz) Vy, 
Pa =Tav4m,c = —=—— — (0, 70-2) Mec 


a ae 
1—v2 us, 


= | 0, ——=— _.] m,c 
gim-m= vy 
2 
/(final) Pcp y(1+ vz) : 4) Moe 
p =Tpvg™.¢ = ——— | - 21, -— 
7 L=2=2 : l+uz 
2 
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Hence, the final momentum of the system is 


2YV x 


/ 2 2 
C=, 


From Eqs. (4.42) and (4.43), we find that the new definition (4.40) of 
linear momentum will conserve linear momentum in both S$ and S’. In 
Sec. 4.6, we shall prove this statement for a more general collision, and in 
Sec. 8.4, we shall arrive at the same definition of momentum in a more 


pi(fnal) — 0] mae. (4.43) 


natural way while discussing 4-momentum. For the time being we shall 
confirm this new definition through the following equation: 


p=Im.,v =I'm,cB, (4.44) 


where v is the velocity of the particle. We have used a subscript 0 under m 
to underscore the fact that m, is the intrinsic mass, often also called the 
rest mass of the particle, for reasons that will be explained. 

Frequently, the ['m, is called the relativistic mass. We shall reserve the 
symbol m (i.e. m without subscript) to mean relativistic mass: 


n=Im, = ——. (4.45) 


In contrast with m, the rest mass m, measures the inertia of the particle 
when the particle is momentarily at rest, i.e. the inertia of the particle when 
the particle is either at rest, or is being accelerated from its (momentary) 
rest position. We can alternatively express momentum in the old style: 


p= mv, (4.46) 


where the mass m appearing now is, however, relativistic mass. It is a 
variable mass and its relation to the invariant rest mass m, is given by the 
expression (4.45). 

Later in this chapter (see Eq. (4.79)) we shall show that the “total 
energy” FE (sum of rest energy and the kinetic energy) of a particle is 
proportional to the relativistic mass m. In fact E = mc?. Therefore, m is 
also a measure of the particle’s energy. Hence a plot of m vs. ( is also a 
plot of E vs. B 

Variation of momentum p and energy FE with velocity has been plot- 
ted in Fig. 4.6, below. In this figure, we have indicated the relativistic 
momentum p using the relativistic formula (4.44), and the non-relativistic 
momentum pyr, obtained by using the N.R. formula (4.35). It can be noted 
that the two plots are almost identical in the region 6 X 0.3, suggesting 
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that we can use N.R. formulas for momentum and energy all the way upto 
0.3 c, without causing appreciable error. 


4.4. Force 


Newton’s second law of motion is a law as well as a quantitative definition 
of force. Following this law, the force F acting on a particle is measured as 
the rate of change of its momentum p: 


(4.47) 


where m is the relativistic mass as defined in Eq. (4.45). 

With the definition of force as adopted in Eq. (4.47), the proportionality 
between force F and acceleration a is lost. In particular, F and @ are no 
longer parallel. We shall establish a relation between the two, as shown in 
Eq. (4.82) below. 

We shall now illustrate the second law by working out the effect of a 
constant force F on a particle of rest mass m, which is at rest at t = 0. 
Since the resulting motion is one dimensional, we shall drop the vector 
symbol. From Eq. (4.47), 


wy =F, aconstant. (4.48) 
dt 
Integrating, and using the initial condition: p = 0 when t = 0, we get 
p= Ft. (4.49) 
However, from Eq. (4.44), 
p=I'm,v =T6m,c, (4.50) 
where 6 and [ are defined in Eq. (4.15). From (4.49) and(4.50), 
F 
ra (4.51) 
m,C 


Also, analogous to Eq. (2.22), we have 


a age oa (4.52) 
1— 6? 
Therefore, from (4.51) and (4.52), 
1 M,C) 2 
ga=it{ st. (4.53) 
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Simplifying, we get 


v =c3 = ———___. (4.54) 


moc\2 
1+ (48) 
Note from Eq. (4.54) that as t > 00,8 > 1. That is, the velocity of the 
particle approaches c asymptotically with time. Also note that the result 
shown in Eq. (4.54) is identical with that in Eq. (4.21), if we set a= £. 


Mo 
The convergence of the two results suggests that a relativistic particle under 


a constant force is undergoing a constant acceleration in its instantaneous 
rest frame. 

We can integrate Eq. (4.54) to obtain an expression for the distance « 
covered by the particle in time t: 


t t t cdt 
r= fvd=ef sar= | 2 (4.55) 
: : o i+ (He) 


or 


2 F 2 
ioe 1+( “) =i (4.56) 


We can make the formulas (4.54) and (4.56) look less formidable by 
defining a characteristic time 


T= : (4.57) 


Formulas (4.54) and (4.56) will now take the following easier looking forms: 


v=ch= ———. (4.58) 
1+) 
t 2 
C= 6T, 1+ (=) —1]. (4.59) 
To 


All relativistic formulas should satisfy one important test, namely they 
should yield the corresponding old familiar results in the non-relativistic 
limit: 

N.R. limit => v<c, 6-0, Tl. (4.60) 


On the other extreme we have the ultra-relativistic limit. 


UR. limit> vec Bo1, Too. (4.61) 
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In the present case, the criterion (4.60) is realized during the early times 
t < 7, when the speed gained by the particle is still small compared to 
the speed of light, and the momentum Ft imparted to it by the force F is 
small compared to m,c. Applying this criterion to Eq. (4.56), it is easily 


seen that 
iy 1/F 
i -1/=- ar. 4.62 
+3(=) ! | 2 (=) ( 62) 


Here F/m, is the acceleration a, so that we get back our familiar 
non-relativistic kinematic result: 2 = at?. 
If we apply the approximation (4.60) in Eq. (4.54), we get 


which is again the old familiar non-relativistic result. 

Before leaving this context it can be useful to carry a “numerical feel” of 
the relativistic formulas just derived, by considering two extreme examples 
of commonly encountered forces. In the first example, we shall consider 
a particle subjected to a constant force equal to the force of gravity (as 
experienced near the surface of the earth). The force on the particle is 
then F = m,g, where g = 9.81 m/s’. Therefore, the characteristic time is 
woe. 8K 10° m/s 


To = SS 


Fg 9.81 m/s? 
= 3.06 x 10’ sec = 354 days ~ 1 year. 


If we measure time in years, then Eqs. (4.58) and (4.63) will give: 


t 
B = - = Jlee (General result), (4.64) 
B=t (NR. result). (4.65) 


The pre-relativistic result (4.65) tells us that the speed of the parti- 
cle should increase linearly and would equal the speed of light in 1 year, 
twice that speed in 2 years, and so on, and will keep on increasing without 
limit. The relativistic formula (4.64), on the other hand, predicts a linear 
increase in speed in the beginning, followed by an asymptotic approach to 
the ultimate speed c at the end. It predicts a speed of 0.71c at the end of 1 
year, 0.89c after 2 years, 0.95c after 3 years, and so on, so that the rate of 
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Fig. 4.5. Velocity @ and displacement x as functions of time t under a constant force. 


increase slows down as time progresses. The v—t relationship is illustrated 
in Fig. 4.5. 

We have just seen that it takes 1 year to reach 71% of the speed of light 
under a force equal to the terrestrial force of gravity. Therefore, if a rocket 
engine keeps firing continuously to generate a thrust equal to the weight of 
the space vehicle for one full year, it will come somewhat close to the speed 
of light. The above example therefore serves to illustrate how impossible it 
is for a space traveller to attain a speed anywhere near that of light. 

If, however, the above example suggests that special relativity has noth- 
ing to do with terrestrial experiments and down-to-earth applications in 
physics, then the following example may correct such a notion. 

Linear accelerators of earlier designs like the Van de Graff accelerator 
and Cockcroft Walton accelerator were used mostly to accelerate protons, 
deuterons, alpha-particles and other ions. Each of these machines can be 
designed to generate a potential difference of several million volts across the 
length of a tube called the accelerating tube. The charged particles required 
to be accelerated are allowed to fall through the potential difference between 
the ends of the tube. 

Let us consider a Van de Graff accelerator having a length of 1m and 
a potential difference of 5 millions volts. We shall use this machine to 
accelerate an electron. We shall, therefore, need to calculate the velocity 
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attained by the electron after traversing the length of the tube and the time 
required to cover this distance. It is given that the rest mass of an electron 
is m, = 9.11 x 1073! ke. Charge of the electron is e = 1.6 x 107!* coul. 
Therefore, the force on the electron is 


F=eE=16x107%x5x10°=8x 107 N. 


The characteristic time is 


_ 9.11 x 1079" x 3 x 108 


eo = 3.416 x 107'° sec. 


To 
From Eq. (4.58), we can obtain the speed of the electron at different 
times as follows: 


v=0.707e at t=7, = 3.416 x 107! sec, 
v=0.89¢ at t= 27, = 6.832 x 107! sec, 
v=0.95e at t=37, = 10.248 x 107" sec. 


We have plotted 6 vs. t and x vs. t in Fig. 4.5. The time axis has been 
calibrated in the unit of 7,. In Fig. 4.5(a), we have indicated the velocity 
achieved after time intervals of T,,27,,37. In Fig. 4.5(b), the vertical axis 
represents displacement x, and has been calibrated in the unit of ct). We 
have indicated the displacement x at times 7), 27,370,470, 57>. It can be 
seen that after time t = 27,, the x — t graph is almost a straight line. 

The time required to cover the distance of 1 m can be calculated 
from Eq. (4.59) using the value of 7. It works out to be t = 10.717 = 
36.59 x 10~-'° sec. The velocity of the electron after accelerating through 
this distance is now obtained from (4.58) to be 0.99565c. 

Note that the electron gains a speed equal to 99.56% of the speed of 
light by falling through a potential difference of 5 million volts. The kinetic 
energy gained by the electron in this process is 5 million electron volts, 
or 5MeV. We shall take up the concept of relativistic energy in the next 
section. 


4.5. Energy 


Let us review how an expression for kinetic energy can be derived in 
non-relativistic physics. For simplicity, we first consider a particle of mass 
m in one-dimensional motion under a constant force F’. The acceleration is 
a = F'/m and is a constant. If the particle starts from rest from the origin 
and moves along the X-axis, then we have the familiar kinematic equation 
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for the velocity v reached after traversing a distance x: v? = 2az, so that 
Fe =maxr = mv. 

The quantity Fa is the work done by the force F' in displacing the 
particle through a distance x. Using the conservation of energy principle, 
one may argue that this quantity F’x of potential energy must have been 
lost by some agent (which may a gravitational or electric field, a machine, 
human muscle, or something else) and has been transferred to the particle 
in the form of its kinetic energy. Therefore, the kinetic energy K of the 


particle is as follows: 


kK = work done in raising the velocity of the particle from 0 to uv 


1 
=Fe= zm (N.R. result) (4.66) 


For obtaining the same result from a more general force F which has arbi- 
trarily variable magnitude and direction, we proceed as follows: 


K = work done = f Pde =m f vat 


=m [veav= 5m [aw-vy= [a (Sm), (4.67) 


In the above equation, the integration is carried out over the path along 
which particle has moved. Since the velocity of the particle is assumed to 
be zero at the beginning of the path, the above integration gives the same 
non-relativistic result, namely 


1 
K= zm (N.R. result). (4.68) 


We shall now apply a similar argument to a relativistic particle. To 
start with, we first consider the simplest example, namely a constant force 
F applied on a particle along the X-axis. The work done in moving the 
particle over a distance x is, as before, Fx. In this process, the velocity of 
the particle increases from 0 to v. It is seen from Eqs. (4.53) and (4.52) 
that 


[mt] =G-1=a 
Fil 62 262° (4.69) 
1262 41=T?. 


Combining the above two relations, we get 
Ft } 


m,C 


r2=1+ | (4.70) 
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It now follows with the help of Eq. (4.56) that 
K = Fx = (T —1)m,c’. (4.71) 


One can derive the above relation for a more general case. Consider 
the applied force F to have arbitrary magnitude and direction so that the 
particle will now move along a more general curvilinear trajectory. Using 
the defining equations (4.47) and (4.44) for force and momentum, we get 


ay) - vdt. (4.72) 


W =work done = [ P-dr=m, [ 7 


The right-hand side of (4.72) will readily transform to the form shown 
in (4.71) with the help of the following identity: 


dTv) od 
Bo vaeo. (4.73) 
To prove Eq. (4.73), we first note that 
il v? 
Differentiating either side, we get 
2c? dP d(v?) 
a 4. 
TS dt dt ae) 
Therefore, 
d(Tv) dv dv 9dr T'd(v?) 
v= —Vv- To evayre— st 
ae a a a a 
a |eae |e 
7 FT? || de- 
Also note from (4.74) that v?+ is =c?. Hence the identity (4.73) is proved. 
(QED) 
From Eqs. (4.72) and (4.73), we have 
p 
Y= me | dl = (T —1)m,c’. (4.76) 
1 


Here we have used the fact that at t = 0,v = 0 so that [ = 1. The kinetic 
energy is therefore given by the earlier expression (4.71). We rewrite this 


Relativistic Mechanics 103 


as follows: 
K=W =(P-1)m.c’. (4.77) 


We shall show that the above expression for kinetic energy indeed 
reduces to the non-relativistic expression (4.68) under the condition (4.60) 


1 1 1 
T-1 =. - 12°14 =6?-1= =8?. 4.78 
Ji-# tar 3° ee) 
Hence, 
Be 22 1 2 
k=(T-1)m,c gineB —— meu . (N.R. result) (4.78b) 


The quantity E = K + m,c? is called the total energy E of a “free 
particle” having velocity v: 


(4.79) 


where m is the relativistic mass, as defined in Eq. (4.45). In contrast with 


2 


the total energy mc?, the quantity m,c? is called the rest energy, or the rest 


mass energy of the particle. When v < c, the above equation reduces to 
1 
E=K+m,c2 ~ zou +m,c? (N.R. formula). (4.80) 


Note that & = kinetic energy + mass energy, always, even in the N.R. 
limit, as we shall find in many applications of the energy formula in the 
sequel. 

In Sec. 4.6, we shall present a satisfactory explanation why mc? is looked 
upon as the total energy of a relativistic particle. 

It is seen from (4.79) that the I-factor represents the ratio between 
the total energy and the rest energy of a particle. Note from Eqs. (4.60) 
and (4.61) that for a non-relativistic particle T ~ 1, and the total energy 
is almost exclusively its rest energy. For an ultra-relativistic particle (see 
the examples in the following paragraphs), [ >> 1. Thus, the [-factor is 
sometimes used as an index of the relativistic “magnitude” of high energy 
particles. 

In Fig. 4.6, we have plotted variation of momentum p and the energy EF 
with velocity. 
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Fig. 4.6. Momentum and energy as functions of velocity. 


The rate of change of [is proportional to the power @ delivered by the 
force F, as we can see using Eqs. (4.77) and (4.72). (You can pronounce 


the symbol @ as “varpi” — a variant of 7.) 
dw dk dE dV 
F. —— = —_ = _ a 4.81 
o a dt a ee) 
The relation F-v = oe is valid when the rest mass mg of the particle 


moving under the force F remains constant. We shall in future encounter 
a situation when this assumption will not be valid. We shall then avoid 
Eq. (4.81). 

As we had remarked following Eq. (4.47) that F and a are not parallel. 
We shall obtain a relation between the two, using the relation (4.81), and 
the definition of force and momentum as given in (4.47) and (4.46): 


=ma+—. (4.82) 


The following relationship between the total energy E and the momen- 
tum p is extremely useful: 


E? = p* + m2ct. (4.83) 


In the N.R. limit, the above formula approximates to the formula (4.80) 
as the reader can easily verify. 


o] 
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To prove (4.83), we shall use Eq. (4.79), the identity (4.69), and 
Eq. (4.44). 


BP aT?mié = (1412 mic = mic +p? (QED) 


To specialize Eq.(4.83) for a massless particle, like photon, we set m, = 0 
and get the corresponding energy-momentum relation 


E=cp, for a photon. (4.84) 


As a simple application of Eq. (4.83), we shall calculate the momenta 
and the velocities of the radioactive particles emitted during the beta-decay 
of a typical gamma-emitter (commonly used in college physics laboratories), 
namely °°Co. ©°Co emits a beta-particle of kinetic energy 0.31 MeV, which 
is followed by two gamma-rays of energy 1.33 MeV and 1.17 MeV, respec- 
tively. The rest energy of the beta-particle (i.e. electron) is known to be 
m,c? = 0.51 MeV. Therefore, from (4.79), FE = 0.31 + 0.51 = 0.82 MeV 

Hence, pe = \/ E? — m2ct = \/(0.82)? — (0.51)? = 0.64 MeV. 

If one uses MeV/c as a unit of momentum, then p = 0.64 MeV/c. 

To compute the velocity of the emitted electron, we shall first compute 
the [-factor, and use the following relation (which follows from Eq. (4.74): 


T?-—1 
ga (4.85) 
The following estimates are obtained using (4.79) and (4.85): T = — - 
0.82 


= 1.61; 6 =0.78. 
The above example shows that the velocity of the beta-particles emitted 
by ©°Co and other radioactive isotopes are fairly relativistic. In the present 
case, the velocity of the emitted beta-particle is v = 0.78c, that is, about 
78% of the speed of light. It also follows from Eq. (4.83) that the momenta 
of the two photons emitted in the radioactive decay of °°Co are 1.33 MeV/c 
and 1.17 MeV/c, respectively (because the rest mass of a photon is zero). 
As a second example, we shall consider the beta decay of 34Cl which 
decays by the emission of a positron of energy 4.5 MeV. As before, E = 
4.50 + 0.51 = 5.01 MeV. 


Hence, pe = 1/(5.01)” — (0.51)? = 4.98 MeV, so that p = 4.98 MeV/c. 


0.51 


5.01 
Also, [ = 357 = 9.8, implying highly relativistic nature of the emitted 


positron. Using Eq. (4.15), one now gets @ = 0.995. That is, the velocity 
of the emitted particle is 99.5% of the speed of light. 
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The above examples serve to illustrate that relativity is not just a 
Utopian idea, far removed from real life encounters. Even undergraduate 
students routinely experiment with electrons and positrons travelling with 
almost the speed of light, in their college laboratories. At the same time 
it should also be remembered that it is not so easy to obtain heavier par- 
ticles, like protons, alpha-particles and heavy ions, at relativistic speeds. 
The following example will illustrate this point. 

Consider an alpha-particle emitted from the radioactive decay of the 
isotope 7!?Po. It has a kinetic energy of 8.78 MeV. This is about the highest 
energy with which any particle is emitted from natural radioactivity. The 
rest mass energy of the alpha-particle is m,c? = 3727.23 MeV. 

Therefore, [ = THmg® — 8.78+3727.28 — 1 924. 

Thus, [ ~ 1, so that the emitted alpha is non-relativistic. Since G < 1, 
we can use the approximation (4.78), and obtain the velocity of the alpha- 
particle approximately as follows: 


B~ /2(P — 1) = 0.0693. 


The speed of an alpha-particle of energy 8.78 MeV is now seen to be about 
7% of the speed of light. 

Consider protons that have been accelerated to an energy of 1 TeV, 
which is equal to 10! ev. Such particles are ultra-relativistic (T = 1067). 
We shall establish the relativistic nature of a more modest energy proton, 
say, one of energy 10GeV, i.e. 10*MeV. The rest energy of a proton is 
m,c? = 938.247 MeV. Therefore, E = 10938 MeV. One can calculate the 
momentum and velocity using the relations (4.83) and (4.85). The final 
answer is as follows: p = 10.9 GeV/c; T = 11.66; 6 = 0.996. 


4.6. Energy-Momentum Conservation Law 


In Sec. 4.3, we proposed a new definition for momentum, through Eq. (4.44), 
with a promise to provide a general proof later that such a definition would 
ensure Lorentz invariance of the momentum conservation law. (The term 
Lorentz invariance would mean preservation of a certain relationship, law 
or equation after the corresponding quantities have been transformed by 
Lorentz transformation, following a change of the frame of reference.) At 
the time of defining total energy of a particle through Eq. (4.79), we also 
promised to justify the nomenclature “total energy” in a subsequent section. 
We shall now redeem these pledges. Perhaps, the reader will be surprised to 
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discover that, unlike in non-relativistic physics, the conservation of momen- 
tum and energy are not two disjointed concepts, but that one implies the 
other. If we demand that momentum be conserved in all frames of refer- 
ence, that itself will guarantee conservation of total energy, and vice versa. 

Before establishing the Lorentz invariance of the conservation laws, it 
will be necessary to establish the Lorentz transformation of energy and 
momentum. We shall, for simplicity, consider boost: S'\(c6,0,0)S’ as repre- 
sented by the configuration shown in Fig. 4.1(a). Let v = is and v/ = ae 
represent the velocity of a particle of rest mass m,, as measured from the 
Lorentz frames S' and S$’, respectively. Then, according to the defining 
Eqs. (4.44) and (4.79), its momentum and energy in these two frames will be 


p=mJv, E=m Jc’, in. 
(4.36) 


p=mJl'v’, EF’ =m,I"'c?, ins’, 


where I’ and I” represent the Lorentz-factors associated with the velocities 
of the particle in S and S$’, respectively. It follows from Eq. (4.37) that 


dr 1 dr- r dr 


os ses —, 4.87 
di dr’ ir en 
Hence 
3 a. da dy dz E_ cdt 
Pee ON ae de ae)? @. ar’ 
(4.88) 
; dr’ dx’ dy’ dz' E! Per 
=m,—_ = m = fa ho —_—- =m ¢ 
r ° dr °\ dr’? dr’ dr)’ e¢ a 


Using the Lorentz transformation equations (4.3) for the coordinate 
differentials, the momentum and the total energy in the frame S$” are now 
expressed as a linear combination of both of these quantities in S (for 
convenience we shall use E’/c, instead of E. Note that E/c has the same 
dimension as momentum p). 


on tt ney (4 — 928) 5 (, — 9 
Pa = dr eT ie Gy) ce)? 


Py =Py, Ds =Pz, (4.89) 
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Fig. 4.7. Collision of two particles. 


The inverse of the above transformation follows without much ado: 


Py = Py; 
4.90 


A comparison with Eq. (3.8) shows that the set (E/c,pz,py,pz) trans- 
forms, under a change of the frame of reference, in the same way as the set 
(ct, X,Y, Z). 

With the above preparation we now reconsider the collision phe- 
nomenon, earlier discussed in Sec. 2.2.2, once again. Figure 4.7 (which is 
a reproduction of Fig. 2.2) describes the setting. Two particles A and B 
come and collide and result in the emergence of two other particles C and 
D from the scene of the collision: 


A+B 3C+D. 


This kind of process occurs frequently in the sub-atomic world, as illustrated 
in the famous example 


qa +p oA+K°, 


in which a pi-meson collides with a proton to create a K-meson and a 
lambda particle. 

As in Sec 2.3, let us assume that the total momentum of the system, as 
measured in the frame S, is the same before the collision as after it. Since 
momentum is a vector, the above statement implies conservation of its x, y 
and z components separately. Using the definition (4.44) of momentum, 
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we get the following three equations: 
PAx + PBx = PCxz T PDz; 
PAy T PBy = PCy T PDy; (4.91) 
PAz + PBz = PCz + PDz- 


In the above, pax stands for the x-component of the momentum of the 
particle A,B, etc. Using the transformation equation (4.90), we can now 
write the above momentum components in terms of quantities measured in 
the frame S’. This results in the following equation: 


E! E! 
7 (o's. ~ o=4) +7 (v's + s=8) 


c (6; 
EI iE 
= (0%. +9=8) +H (0. + s=2) ; 
or 
E’ E Et E’ 
/ / / / A B Cc D 
+} 6({—4+#8----—) =0. 
(DAx PBz PCx Pox) a( a 7 e a é ) 


Since 6 is arbitrary, the above equation separates into the following two 
equations. 


/ eo / 7 
Hale = Hab Ep as 
Pax + PBe = Pcs + Ppa: 

We note from Eqs. (4.91) and (4.92) that conservation of the « component of 
linear momentum in the frame S implies conservation of both momentum 
and total energy in the frame S’. We would have come to the same conclu- 
sion for the other components of momentum had we started this exercise 
from a general boost. We therefore summarize as follows. Conservation of 
momentum in any one frame implies a wider conservation law in all other 
frames, which brings within its fold not only momentum conservation, but 
also energy conservation, that is, conservation of “total energy” as defined 
by Eq. (4.79). Momentum and energy become inseparable from each other 
in relativity, a fact we shall see more transparently in Sec. 12.3. 

We shall rewrite this very important concept in the form of more 
general equations for a fuller comprehension of its significance. Suppose 
there are n particles, which are marked for identification as 1,2,...,n. 
They are allowed to interact. Following the interaction, they may either 
retain their original identities as particles numbered 1,2,...,n, or may 
change into N other particles, which may now be marked by numbers 
n+1,n+2,...,n+N. Considering the more general case of N particles 


110 Special Relativity, Tensors, and Energy Tensor 


resulting from the interaction (N = n, and the particle marked 7 is identical 
with the one marked 7 + n in the special case where the particles do not 
change in the interaction process), we can now use the definitions (4.44) 
and (4.79) to rewrite the conservation laws, as implied by Eq. (4.92), in 
any general frame of reference S, as follows: 


n n+N 
So mol ivi = yy moilivi, (4.93a) 
i=l i=n+1 
n n+N 
So moiF ie? = x mol ;c?. (4.93b) 
i=1 i=n+1 
In the above Tj = = =. Equations (4.93a) and (4.93b) together rep- 
7 ge 


resent conservation of four quantities namely pz, Py,pz and E. One lesson 
from Eq. (4.93 b) is that it is times mass, and not mass alone, which is a 
conserved quantity in relativity. (The word “mass” will always mean rest 
mass m,, unless relativistic mass m is explicitly implied.) Conservation of 
mass, a guiding principle of chemistry, wilts away under the onslaught of 
relativity: 


n n+N 
S- moi # > moi (in general). 
i=1 i=n41 


Mass before reaction # Mass after reaction (in general). 


The reader should now see the reason for calling E = mc? = T'm,c? the 
total energy in the definition (4.79). It is this 'm,c? which is conserved in 
all natural processes. Since (T — 1)moc? has already been shown to be the 
kinetic energy, the balance m,c? is supposed to be the potential energy of 
the particle manifested in the form of its rest mass. This rest mass potential 
energy can be converted into kinetic energy, or some other form of energy. 
Nature provides an abundance of such examples. 

Most of the practical examples of rest mass potential energy being con- 
verted into kinetic energy lie in the domain of nuclear physics. An awe- 
some example is the fission of uranium. The 7°°U nucleus captures a slow 
neutron and spontaneously breaks up into a number of smaller fragments, 
releasing in this process an enormous amount of energy. This fission reac- 
tion is utilized not only in the detonation of the atom bomb, but also in 
the extraction of useful energy in most of the nuclear power reactors. Even 
though the products of this reaction are not the same in different fissions of 
the same 2°°U isotope, they generally consist of two medium weight nuclei 
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(one of them with a mass number between 85 and 104, and the other with a 
mass number between 130 and 149), along with about two to three neutrons 
and about six to seven electrons. One typical such reaction is the following: 


2357, +n + 9° Mo +9 La+7e7 + 2n. 


The right-hand side of the above reaction has less mass than the left- 
hand side, so that mass has not been conserved in the reaction. The energy 
equivalence of the difference between the rest mass before the reaction and 
the rest mass after the reaction is the energy released in a single fission. 
It goes into the kinetic energy of the fragments. When there are some 
1073 nuclear fissions taking place in a fraction of a second, the resulting 
fission fragments will be dashing about with their own shares of the released 
energy. The net effect is then a violent liberation of heat. 

In the table below, we have listed the parents as well as the products 
of the above fission reaction, their rest masses in atomic mass unit, and 
converted the mass difference into energy, so as to provide a numerical 
illustration of how mass is converted into kinetic energy.* We have used 
the conversion factor: 1 a.m.u. x c? = 931.16 MeV. 


ce 
Before | U-235 235.0439 
n 1.0087 


236.0526 219,802.73 


After | Mo-95 94.9058 
La-139 138.9061 
2n 2 x 1.0087 
7e 7 x 0.00055 


235.8331 219,598.39 


Difference | 0.2195 204.34 
(0.09% of the original 
mass energy) 


As another example of mass-energy conversion, we shall consider a 


fusion reaction, in which four hydrogen nuclei, i.e. protons, fuse to form a 
helium nucleus: 


Ap —» *He + 2et, 


“A good description of the fission and the fusion processes have been discussed by 
Samuel Gladstone [12]. The mass values have been taken from the same book. 
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where e* represents positron, having the same rest mass as an electron. 
We have the following table. 


[ [Particle [Wass ann) [me Gn Me) 


Before | 4p 4 x 1.0078 
= 4.0312 3753.692 


After | He-4 4.0030 
2e 2 x 0.00055 
| Total | 4.0031 | 3727.526 526 


a 0.0281 a 166 
(0.7% of the original 
mass energy) 


We have provided more examples in the suggested problems at the end 
of this chapter to give the reader a thorougher familiarity with the energy- 
momentum conservation principle. 

If, the examples cited so far have given the reader a wrong notion that 
these principles are applicable only in physics and not in chemistry, then 


we shall consider the example of the hydrogen atom to clear this misunder- 
standing. Actually the mass of a hydrogen atom is less than the mass of its 
constituents, namely, a proton and an electron. However, this difference, 
converted into mass energy, is only 13.6eV. Compared to the rest mass 
energy of the hydrogen atom, which is about 938.36 MeV, this 13.6eV is 
very negligible, only about 1.45 x 10~°% of hydrogen’s rest energy. This 
mass is so negligibly gained, or lost, in every chemical reaction, that a 
chemist does not need to pay any attention to it. However, it will be useful 
to remember that when a candle burns, or ice melts, these processes are 
accompanied by a very small change in the masses. The end products are, 
lighter in the first example and heavier in the second one, although the 
differences are so tiny that it will be very difficult to measure them. 


4.7. The Centre of Mass of a System of Particles: 
The Zero Momentum Frame 


The centre of mass (C.M.) of a system of particles plays an important role 
in non-relativistic classical mechanics,” especially in a many-body system, 


bSomnath Datta [8, Chapter 12]. The CM has been defined on p. 470, the rigid body 
motion, especially the precession of a spinning top with diagrams on pp. 513-521, the 
two-body system with several applications on pp. 521-533. 
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like the motion of a rigid body (which is made up of a very large number 
of atomic particles), and in two-body systems. like the Earth—Moon pair 
moving in the gravitational field of the Sun. 

The CM frame of such a system is the inertial frame in which the CM 
is stationary, even though the individual particles may be moving with 
arbitrary velocities. Also the number of the constituent particles, and their 
velocities, may differ following a reaction or a collision, and yet the CM of 
the system will move on with a constant, unchanged velocity. 

The relativistic counterpart of the classical CM is defined along parallel 
lines, except that the invariant mass of each constituent particle is now 
replaced by the relativistic mass of such particles.° 

Let us consider system of N particles, having rest masses, radius vectors 
and velocities {moi, ri, vi; 7 = 1,2,3,...,N}, with respect to an inertial 
frame S. We define the location rey, and the velocity Vem of the CM as 
follows: 


N 
_ Deizo MT (4.94a) 


N 
gs Pit Pa OG (4.94b) 


where, m; =Tmo; = the relativistic mass of the particle i, (4.94c) 


N 

P= y m,V; = Total momentum of the system, (4.94d) 
i=0 
N 

M= y m, = Total relativistic mass of the system.  (4.94e) 
i=0 


For zero mass particles, like photons of light frequency v, m; = hv/c?. 

Since P and F are each conserved for an isolated system, these two 
values should remain unchanged following a chemical or nuclear reaction. 
Therefore, it follows from Eq. (4.94b) that the velocity vem should remain 
unchanged in such a reaction. In order to clarify the concept of CM, and its 
uniform motion in the relativistic case, we have worked out a few examples 
in Problem 4.7. 


©Rindler [4, Example 6.5, p. 126]. 
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The zero momentum (ZM) frame is the relativistic analogue of the CM 
frame of N.R. mechanics. It is the inertial frame in which P = 0. 

To make a transition from the Lab frame to the ZM frame one has 
to make a Lorentz transformation of the energy-momentum 4-vector. We 
have taken up this topic seriously in Sec. 8.8.1. 


4.8. The Twin Paradox 


An example which is often cited for illustrating the relative aspect of time 
is the Twin Paradox. It is a riddle around the ageing rates between the 
brother who stays home and his counterpart who undertakes a space voyage. 
But then, why should there be any ageing difference, given that all inertial 
frames are equal? The clue lies in recognizing that one of the brothers is 
necessarily in an accelerating frame. 

There are several versions of this paradox (see References) most of which 
allude to a very brief but violent acceleration at the beginning, at the turn- 
around point, and at the end of the journey. This tends to take away atten- 
tion from the accelerating aspect which is the crux of the riddle. Therefore, 
we shall choose a version of the paradox (see J D Jackson Problem 11.4) in 
which the acceleration is moderate but steady and lasts through the entire 
duration of the voyage. 

Ram and Sam are twins who have been living in an inertial space lab S 
(shown as inertial frame S$ in Fig. 4.8) since childhood. Ram being the more 
adventurous of the two, decided to take a space odyssey when they were 
20 years old. He boards a rocket R (shown as frame R) which accelerates 
at a constant rate a with respect to a comoving Lorentz frame R. After a 
time span of n years, as recorded on his watch, Ram, wanting to come 
back, starts decelerating at the same rate a for another n years to reach 
zero speed with respect to S. Then he turns around, accelerates homeward 
at the same rate a for n years, then decelerates for another n years and 
safely lands at their old Space Lab to reunite with his brother Sam. Ram 
is now 20+ 4n years old. How old is Sam? 

Let us first set up the basic differential equation required for finding the 
answer. 

Let P and Q be two adjacent points along the path of the rocket at 
distances x and « + dx from the starting point O of Ram’s journey. Let v 
and v + dv represent the velocities of Ram with respect to S while passing 
P (“event P”) and Q (“event Q”), respectively. Let Ro(Qp) represent the 
co-moving Lorentz Frame at the event “Op”. We shall assign the following 
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coordinates to the above pair of events with respect to the three frames of 
reference. 


ct,z) in S 

“Op” = ¢ (ct’,0) in Ro(Op) ?, (4.95) 
cr,0) inR 
ct+cdt,u+dzr) in S 

“Og” = ¢ (ct! + cdt', dz’) in Ro(Op) >. (4.96) 
ct + cdr, 0) in R 


Noting that the frame R is moving with infinitesimal speed at the event 
“Og” relative to the frame Ro(Op) it follows that dr = dt’. Therefore, 
from Eq. (4.17a) 


> =adr, (4.97) 
-@ 
which is the required differential equation for this problem. 
In Fig. 4.9 we have shown the path followed by Ram. We have demar- 
cated his route into four segments namely, outward acceleration along OA, 


Y 
x 
4] 
Ls (I Vv 
_— PQ 
— o—e , 
= tTT4+dt < Rocket time 
t t+dt < Lab time 
O | | x 
zs dx 


Fig. 4.8. Sam’s frame 5 and Ram’s frame R. 


A B 
Oe =O =< 
De= e< e 

Cc B 


Fig. 4.9. Round trip route of Ram. 
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deceleration along AB, turning around at B, homeward acceleration along 
BC, followed by deceleration along C'D, and finally return home at D. 
Let us consider the first segment of his journey, namely to O to A. 
Let t and 7 be the times measured in the frames S and R, respectively, 
corresponding to the event that “Ram has travelled a distance x from O” 
on this segment of his tour. The velocity of Ram at this point is obtained 


by integrating Eq. (4.97). 
| ae af dr, (4.98) 
o. b= 0 


or tanh =< (4.99) 
c 
Noting that 
dt = ydr, (4.100) 
where 
1 1 
a a ae 4.101) 
1-% 4/1—tanh? = c 
dx 1dzx 
that =— = -— 4.102 
so that v= = mae 02) 
d: 
we get “ =ctanh cosh = esinh ™. 4.103) 
dt c c Cc 
Integrating again, 
T 2 
£= ey sinh dr = — [cosh = 1 ‘ (4.104) 
0 c a c 
To find t we integrate Eq. (4.100) and get 
: : ar c aT 
t= i ydt = i cosh —dr = — sinh —. (4.105) 
0 0 Cc a Cc 


Let the time recorded on the clock of Ram at the instant when he has 
reached A be 79. Then the time and space coordinates (ct4,a,4) of the 
event “Ram has reached A”, with respect to the frame S$, are obtained by 
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replacing T by 79 in Eqs. (4.104) and (4.105). 


2 

ie [cosh ees. (4.106a) 
a CG 

ta = <sinh 2. (4.106b) 
a Cc 


The velocity of Ram at this point follows from Eqs. (4.99): 


va =ctanh 2. (4.107) 
(é 


The equation of motion for the second leg, i.e. AB, is obtained by 
replacing a by —a in (4.97). Integrating we get 


Vv d T 
/ — =-a | dr. (4.108) 
VA 1 . ra TA 


Now making use of Eq. (4.107), we get 


a(2T) — T) 
Cc 


eS = a] 


v = ctanh | | T| <T < 270, (4.109) 


y = cosh » TOS<T< 27. (4.110) 


To find the corresponding time measured in S, we integrate as in 
Eq. (4.105) 
t 
/ a= | ydt (4.111) 
ta TA 


a(2T|) — T) 


and get 


t= 214 Ssinn | | 7) <7 < 21. (4.112) 
a 


Cc 


To obtain x, we proceed similarly and get: 


/ ax = f yu dr, 
LA To 
2 


ie 
or L= = 2 cosh (=) — cosh (=) - | ; 
a c c 


|] <T< 27. (4.113) 


Setting 7 = 79 in Eqs. (4.109), (4.110) and (4.111), we get the coordi- 
nates and the velocity of Ram at B: 


tp = 2t a, LB= 22 A, UB = 0. (4.114) 


118 Special Relativity, Tensors, and Energy Tensor 


The return journey is a mirror reflection of the outward journey. Hence, 
the duration and the furthest distance of Ram’s space odyssey, as measured 
by Sam are as follows: 


: (4.115) 
bao: [cosh (=) - 1 
c 
In contrast, the travel time, as measured by Ram himself, is 
To = 47. (4.116) 


For a numerical feel of the problem, the reader may consider a to be equal 
to the acceleration due to gravity near the surface of the earth, and take 
T) to be equal to 5 years, so that Ram returns home at the age of 40. This 
will then lead to the following estimates: 


D=16x10"%km!! T = 335 years !!!! 


4.9. Compton Scattering 


As an illustration of the energy-momentum conservation laws, we shall cite 
one example of historical significance in atomic physics. It is the scatter- 
ing of X-rays by atomic electrons resulting in a shift in the wavelength of 
the scattered X-rays — an effect known as Compton Scattering. Since the 
outer electrons (with binding energy of a few electron volts) of the atom 
are chiefly responsible for Compton scattering and since the X-ray pho- 
tons have comparatively higher energies (Ky X-ray photons from Tungsten 
have energy of 69.83keV), these electrons are assumed to be at rest (i.e. 
zero energy) before collision — an assumption that simplifies calculations 
greatly. Figure 4.10 shows the scattering mechanism schematically. The 
incoming photon is scattered at an angle —0@ with the direction of incidence 
(which is taken to be along the X-axis), after hitting an electron, which is 
knocked off at an angle of ¢. 
The energy of a photon of frequency v is given as 


E(v) = hy, (4.117) 


where h is Planck’s constant. The magnitude of the momentum of the same 
photon (whose rest mass is zero) follows from Eq. (4.84) to be 


piv) = - (4.118) 
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Before collision After collision 


Fig. 4.10. Compton scattering. Schematic diagram. 


Let us first write down the momentum conservation equation. The momen- 
tum of the photon, before and after the collision, is given by the vectors 
Pi and pg respectively, whose magnitudes are, according to Eq. (4.118), 

_— hvyz = hv2 
PS 3 2 Se 
before and after the scattering. If p. is the momentum of the recoiling 
electron, then the momentum conservation statement is that 


, where 11,12 are the frequencies of the X-ray photon 


P1 = P2+ Pe, 
or Pe = P1 — P2. 


(4.119) 


Squaring either side, 
2 ee 
De = Pi + Pz — 2pip2 cos @. (4.120) 


Now we shall obtain the energy conservation equation. Letting the rest 
mass of the electron to be m,, and using Eq. (4.78), we get the initial and 
the final energy of the system as 


2 
E\nitial = hvy “Ir Ee rest =pic+tm,c, 


Egnal = hve + Eemoving = poe + o/m2ct + pec?. 
The energy conservation therefore means 
pic+m,c? = poc + \/(m,c2)? + p2e2. 
Simplifying we get 


pe = (pi — p2)” +2 (r1 — p2) mec. (4.121) 
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From (4.120) and (4.121): 


(p1 — po)” + 2 (p1 — po) mc = p? +p} — 2pipecos@, —(4.122a) 


or (pi — p2)m,c = pipe(1 — cos6), (4.122b) 
n\2 
or h(vy—Vv2)m, = (=) V1V2(1 — cos), (4.122c) 
c 
1 i h 
or ¢ (= - +) = (1 — cos @). (4.122d) 
Zp) VY, MoE 


From Eq. (4.122c), we get the change in the gamma ray energy: 


hvy hv2 
Ce 


hi, — hig = (1 — cos @). (4.123) 


° 


From Eq. (4.122d), we get the change in wavelength, using the fact that 
AV =: 


h 
AX = d2- 1 = (1 — cos 8), (4.124) 
MC 


) 


where Aj, A2 represent the wavelengths of the incident and the scattered 
X-rays. 


The factor a appearing in the above equation is called the Compton 


wavelength and is denoted by the symbol A,. Its value can be computed: 


h 6.63 x 10-34 
= O_O 243 x 107 m = 0.0243 A. 
m,c (9.11 x 10-31) x (3 x 108) . = 


de = 
(4.125) 


The kinetic energy T. transferred to the ejected electron is the same 
as the change in the photon energy. To find this energy we first solve 
Eq. (4.123) for hv: 


m,¢c? hivy 


hig 1068) 
Ro + (1 — cos) 1+ (=) iis 


1 —cos6)h 
Therefore Ty = hv, — hry = _( en) (4.126b) 


2 + (1 — cos6) 


hvy 
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We shall now obtain a relation between the scattering angle @ and 
the electron ejection angle ¢, from the momentum conservation equation 


(4.119): 
Pi =Pii, pz = p2(cos i — sin j), 
Pe = Pe(cos di + sin dj), (4.127a) 


Pe Cosd =p, — p2cos@ (conservation of z-momentum), (4.127b) 


pe sing = p2sin@ (conservation of y-momentum). (4.127c) 
ee Pi/ Po — cos 0 m aii — cosé 
sin 0 sind 
hi, 1 —cos@ 
=(1 ———_}. 4.12 
( ss — ( sin 0 ) eda, 


To obtain Eq. (4.127d) we used Eq. (4.126a), and made some algebraic 
manipulations. 

We have plotted T, vs. hv, and @¢ vs. 6 in Fig. 4.11. It is seen from 
Fig. 4.11(b) that the electron is deflected by the angle 7/2 at 6 = 0, whereas 
common sense would suggest that for a direct hit, the target particle should 
recoil in the same direction. It is now seen from Fig. 4.11(a), and also 
evident from Eq. (4.126b), that the electron’s share of the incident y-ray 
energy goes to zero as 9 — 0. For a zero energy particle, the deflection 
angle can be anything. However, at back scattering of the y-ray, i.e. at 
§ = 7, the deflection angle is zero, which is what is expected. 


T,(MeV) o 


4 > ee) 
BZ m/2 3n/4 Tt 
0 0.2 0.4 0.6 0.8 | 1.2 14 1.6 1.8 2 (b) 
(a) hy, (MeV) 


Fig. 4.11. Compton scattering. (a) Electron energy vs. incident photon energy for four 
values of photon scattering angle (indicated by the side of each graph). (b) Electron 
ejection angle vs. photon scattering angle for four values of photon energy in MeV 
(indicated by the side of each graph). 
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4.10. Summary of Important Formulas 
Lorentz factor for boost. See Eq. (2.20). 

y= > where 6 = =. 
Lorentz transformation from S to S”. See Eq. (3.8). 
ct’ = y(ct — Ba), 
x! = (x — Bet), 
y =y, 


zz =£2. 


Velocity addition formulas. See Eq. (4.6) 


/ Uz — U / Vy / Uz 
Uz = —_ Uz UW ? Vy = 1 Urzu\? Uz = 1 Ug U 
C2 Y ( ~ e ) Y ( c? ) 


Other important formulas. 


1 
Dynamic Lorentz factor T= Vin B=, (4.15) 
Momentum p=lmov, (4.44) 
Relativistic mass m = Tm = —, (4.45) 
‘~ 2 
dp 
Equation of motion F= aT (4.47) 
Kinetic energy K = (C —1)moc’, (4.77) 
Total energy E=K+moc? =I'moc? = mc’, (4.79) 
Relation between p and EE? = c?p* + mact. (4.83) 


4.11. Worked Out Problems IT 


Problem 4.1. A particle P decays into two smaller particles D, and D2.¢ 


P+D,+ Dz, 


See [13]. 
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(a famous example being A + p+ 77; also see the example suggested in 
Example 4.5). Let M be the mass of the parent particle and let m1, m2 be 
the masses of the daughters (“mass” will mean “rest mass” , unless otherwise 
stated). Energy consideration requires that M > mj, + mag, so that 


Q=(M—m —ma)c (4.128) 


is the energy liberated in the reaction in the rest frame of P (or the disin- 
tegration energy). 


(a) Let py; and £; represent the momentum and energy of D; in the rest 
frame of P. Using energy and momentum conservation laws and Eq. (4.78), 
show that 


2 
M2 ae 2 2 
ed ee 4.129) 
M 2 2 
p= {eae be (4.130) 


(b) Note that p; = po. However, Eq. (4.129) does not exhibit explicit sym- 
metry between m and mz. Therefore, recast Eq. (4.129) into the following 
symmetric form: 


pam = LRT FPR I, a5) 


(c) Hence, show that 
Oat r, (4.132) 


where 7), T> are the kinetic energies of the daughters D,, D2, respectively, 
in the rest frame of P. 


(d) Use the above equations to the observed decay of pi-meson (7*) to 
mu-meson (j:+) and neutrino (v,,). Take the masses, in unit of MeV/c? as 
follows: M, = 139.6; m, = 105.7; m, = 0. Calculate the kinetic energies 
of w* and v, in the rest frame of zt. 


References: Jackson [13], Griffiths [14]. 
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Solution. (a) Use energy-momentum conservation in the rest frame of P: 
E,+F,=Me?, pitp2=0. (a) 
Hence, FE, = Mc? — Eo, (b) 
or \/p3c? + mict = Mc? — \/pkc? + m3c4 
= Mc? — \/pic? + msc. (c) 


M? Se fs 
Squaring, simplifying, \/p7c? + m3ct = i a (d) 
M2 DF need 2 
or pe = {S| — mic’. (e) 


Also, E? = p?c? + mic* 


M? 4+ m2 —m?)c)” 
{{ 3 i) 2 — m3ct + mic 


2M 
ie . 
_ Ey (M? + m3 - m)| , (f) 
M 2 2 
Hence, Ey = ‘S + aa c. (g) 
M m2—m 
Similarly, By = ‘> + — c. (h) 


(b) By algebraic manipulation, we rewrite (e) as follows: 


= e? {M? — (m1 + ma)?} {M? — (m1 — m2)? } 


_— me 
: 4M? ~ Po 


showing symmetry between m1 and mg. 

(c) Q = M — my, — m2 = mass excess. 
@ 
2M 

or ann = M? +m? — m3 —2Mm, = (M —m)* — m3 
= (M — m, — m2)(M — m1 + m2) = Q(2M — 2m, — Q), 
Q(2M — 2m ~Q) 9 


T = Ey, -—mc = (M? + m3 — m7) —mc?, 


rT, = 
— IM 
_ Q(2M —2m2-Q) » 
lake t= 
Similarly, T> oar c 
2M +2(M —m,—m2— 
Adie Tag COMA oe 8) Be 


2M 
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(d) M=139.5MeV/c?; | m,=105.7MeV/c?; m2 =0MeV/c?; 
Q = 139.7 — 105.7 = 33.9 MeV/c?. 


Q(2M —2m-Q) o_ Qo _ 33.9? 


ae 2 Ea 
! 2M 9M ~ 2x 139.6 ay 
7, Q2M-2m2~Q) 2 Q2M~@) » 
2M 2M 
33.9 x (2 x 139.6 — 33.9) 
= ON + 29.78 MeV. 


Problem 4.2. A particle is moving with velocity v along the XY-plane, 
making an angle 0 with the X-axis, as seen from the frame S. 


(i) Find the magnitude of the velocity of this particle in the frame S’ 
under the boost:$(cG,0,0)S". 

(ii) Show that the angle 0’ of this velocity with the X’-axis is given by the 
angle transformation formula: 


; ysin@ 
= ——______. 4.1 
tan 0 y@eonO 8) (4.133) 


(iii) Specialize to the case of a photon, and show that under the above 
boost, the magnitude of the velocity does not change. However, the 
angle of inclination changes to 


sin 0 
tan 6’ = ———___.. 4.134 
and = T(cosd — 8) eres 


(iv) Modify the above equation to obtain the aberration angle ¢@ of a 
starlight which comes at an angle ¢ in the Sun’s frame (frame of the 
“fixed stars”). See Fig. 4.12. Show that 


tang! = 


(4.135) 
Hint: Set 0 = —(7/2+4+ ¢). 


Solution. The velocity of the particle in the frame S is v = v (cose, + 
sinfe,). Let v’ = v'(cos@’e, + sin6’e,) be its velocity in the frame S$’. 
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o 
4e78 I y(n*0.1,.90FZ 3 
B=0 
1.0492 ~~ ge OP LSS SD ie eassteo ss 42 
; Ro} 
O- f : ~ 
; ' S 
: 0.6 { ' x 
/ 0: § 
os 0. sas 
523599 K-- 4-1-4 46-2 ee ee eee ele eee eee eee eee 
ne fou 0.523599 - 1 
S77 0=-(m/2+9) (a) 
ie 11 
pt 0.0 
0 1 
0 0.523599 1.0472 1.5708 
1 2 3 


in unit of 1/6 
(b) 
Fig. 4.12. Aberration of starlight. (a) The angles ¢,¢’ made by the starlight in the 


Sun’s frame S and in the frame S’ moving with speed c8 with respect to the Sun. 
(b) Plot of ¢’ vs. @ for values of 8 = 0,0.1,0.2,...,0.9. 


Setting v = cv, and using velocity addition formula (4.23), 


ycos@ — B 
a 1— Br cos@’ (a) 
ysin@ 


‘sin Of = ———____ b 
ia (1 — By cos)’ (b) 


2 ycos@ — 6 7 ysind 
ee (a5) {eo} (©) 


Z Pi =v*) 
=e (1 — By cos 6)?" (d) 
ee. ysind 
tan#’ = >(v cos 0 — 8)’ (f) 


Answers to parts (i) and (ii) are given in Eqs. (d) and (f). That to part 
(iii) is obtained by setting v = 1. 

We have explained the light aberration problem in Fig. 4.12(a). In 
Fig. 4.12(b), we have plotted @ vs. ¢, making tick marks on the axes 
at intervals of 7/6. 
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Earth moves along its orbit with a speed of 30km/s, which means 
8 =10-*. This makes the ¢/ = ¢. In actual terms, ¢/ & 6 = 10~* radian, 
for ¢ = 0, which is approximately equal to 20 arcseconds, when the star is 
at the zenith. 


Problem 4.3. Let one of the daughters of Problem 4.1, say Dz, be a par- 
ticle of zero rest mass (e.g. photon, neutrino). (One typical such reaction 
is the transition of an atom or nuclei from an excited state to the ground 
level, or any lower level, accompanied by the emission of a photon.) Show 
that for this case 


(M2 — m2) 


pic = ogre (4.136a) 
M2+ 2 
B= (emi) (4.136b) 


Solution. Use Eq. (4.130). Set mz = 0. 


Problem 4.4. Let an atom (or a nucleus) make a radiative transition (i.e. 
the process is accompanied by the emission of a photon) from an excited 
state of energy € (measured from the ground state) to the ground state. 
Let the mass of the atom (or the nucleus) at its ground state be Mp. Using 
Eq. (4.130), or starting from energy-momentum conservation equations, 
show that the frequency v of the emitted photon is given by the relation 


€ 
hv =e |1- ———————- 4.137 
V € | Qe x ea ’ ( ) 
where h is Planck’s constant. 


Solution. Using Eq. (4.130), setting m; = 0, Ey = hv, M = Mo +e/c’, 
mez = Mo, we get the answer. 


Problem 4.5. An object of mass m, and moving with momentum p; col- 
lides and coalesces with a stationary object of mass mg. Show that the 
mass M of the compound object thus formed is given by the relation: 


M?e? = mic? + mac? + 2ma/ pic? + mict. (4.138) 


Solution. Let E,, E2 represent the energies of the objects before collision. 
Let P,E,M represent the momentum, energy and mass of the coalesced 
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object (after collision). Using Eq. (4.83), we get 


Ey = Jmict + prc: E> = moc?. 
Energy Conservation: E = FE, + Ey = \/mict + pic? + mec’. 
Momentum Conservation: P= p,. 
Using Eq. (4.83) again: M?c? = E?/c? — p? 


= mic? + mac? + 2m /pic2 + mict. 


Problem 4.6. In a {p+ “Li >® Be* + a+ a} reaction, protons of energy 
3.00 MeV fuse with a stationary “Li nucleus to form a compound nucleus 
8Be*, which is the nucleus of ®Be in an excited state. The compound 
nucleus has a short life time ~107!°s. It breaks up into two a particles, 
moving in opposite directions.° 


(a) Determine the mass, momentum and recoil velocity of the compound 
nucleus in the Lab frame. Is this velocity relativistic? 

(b) What is the energy level of the compound nucleus (i.e. the energy of 
the nucleus, in MeV, above the ground level of ®Be)? 

(c) Find the kinetic energy, momentum and velocity of each a particle in 
the rest frame of the compound nucleus. 


Solution. 
Answers to Part (a). We have explained the nuclear reaction in Fig. 4.13 
In the following, we shall use the symbol “u” to mean atomic mass unit. 


8p * 
Seen from the rest frame of Be | 
k 


* 
(Ve? =2( we2+k) 


Seen from the rest frame of Ti 


me? 4K + Mc? = (Y02+K =0 


(a) 


Fig. 4.13. Energy conservation in the nuclear reaction: {p+ 7Li + ®Be* > a+ a} 
reaction. (a) the reaction {p + 7Li—> ®Be*}; (b) the reaction {8Be* > a+ a}. 


°See [15, p. 203]. 
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We shall use Eq. (4.138) in which particle # 1 will represent the proton, 
particle # 2, the Lithium nucleus “Li. The energy of the incident particle, 
i.e. the proton, is non-relativistic, being only 3 MeV compared the rest 
energy of the proton which is = 1 u.c? = 931.48 MeV. 

We shall write (m, p) for the mass and the momentum of the proton, 
for the mass of the target “Li nucleus, @* for the mass of the compound 
nucleus ®Be* in the resulting excited state, #1 for the mass of the same 
nucleus in its ground state, and set 


6M =a" —(M +m). (4.139) 


Equation (4.138) now becomes 


gi’? = m? + M2 42M (4 ) +m? (4.140) 
c 


= (m+ M)?—2Mm+2M (2) + m2, (4.140b) 
or [Al + (m+ M)\[ar* — (m+ M)] + 2mM 
=2M ()" 4+ m2, (4.140c) 


or, [2(M +m) +5M|5M +2mM =2M,/(* ) 4+m2. (4.1404) 
Cc 


Let K represent the K.E. of the proton. Then, in this N.R. case 


2 ImK 
K=5;5(*) ==>. 
‘/ ae r (4.141) 
However (=) =z <1 
G MC 


GP mfie(Ey=m(ve3(Z)). aay 


Therefore, from (4.140d) 


2 
[2(M +m) + 6M]6M +2mM =2M4/ (4) +m?, — (4.143a) 
G 


1 2 
or, 2(M+m)5M+2mM ~2Mm (1 +5 (+) ) (4.143b) 


MC 
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2 
or, 2(M+m)é6M =Mm (=) 
MC 
2 


or, 6M = Won (+) ~ Tos (=) 


= Mm k 
~ M+m \me2)° 


Now we collect some mass values from [16]. 


u = latomic mass unit = 931.481 MeV/c’, 
1u x c? = 931.481 MeV, 
C2 
931.481 
= 9.662 x 10’? u.(m/s)’, 


1 MeV = 


= 1.07356 x 107? x c?u.(m/s)? 


m = m(proton) = 1.00782 u, 


M = m(‘Li) = 7.016004 u, 
$A = m(®Be) = 8.005305 u, 
pb = m(a) = 4.0026 u. 


Go back to (4.143d), taking K = 3 MeV as given: 


K | 3 
mc? 1.00782 x 931.481 


Mm _ 7.016004 x 1.00782 7.0709 
M+m _— 7.016004+ 1.00782 8.02382 


= 3.1957 x 1073, 


= 0.88124, 


56M = 0.88124 x 3.1957 x 107? = 2.8162 x 107? u. 


Therefore, from (4.139), mass of the compound nucleus ®Be*: 


su" = (M +m) +5M = 8.02382 + 2.8162 x 107? = 8.02662 u. 


(4.143c) 


(4.143d) 


(4.144a) 
(4.144b) 


(4.144c) 


4.144d 
4.144e 


— 


4.144f 


—_~ 


4.144¢ 


—_~ 
Sa SN Rs Ne Re 


4.144h 


(4.145) 


(4.146) 


Let us write (E,K,P,V) for the (total energy, kinetic energy, momen- 
tum, recoil velocity) of °Be*, following the reaction. Since the energy of 
this nuclide is in the N.R. domain, we can write E = K+ * c?. See (4.80) 
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and the comment following (4.83). Due to conservation of energy: 


K(p)+(M+m)?=K-+ sc’, 
or K = K(p) + {(M +m) — #"}c? = K(p) — 6M? 
= 3 — 2.8162 x 10-3 x 931.481 = 3 — 2.623 = 0.377 MeV. 
= 0.377 x 9.662 x 10'3 u.(m/s)? = 3.6426 x 10! u.(m/s)? 
[used (4.144 d)], (4.147) 


P= V2" K = \2 x 8.02662 x 3.6426 x 1018 


= 75.8475 x 10° = 2.418 x 10’ um/s, 
P 2.418 


= A —-8.02662 


x 107x3x10° m/s= 


c 
100° 


The domain of relativistic dynamics starts at ~ c/4. See Example 1 in 
Sec. 9.7. Hence the recoil velocity V is non-relativistic. 


Answer to Part (b). We shall find the difference between the mass #" of 
®Be* and the mass # of ®Be. 
6M =m" — (M +m) = 2.8162 x 10-3 u, 
AM = (M +m) — = (7.016004 + 1.00782) — 8.005305 
= 18.519 x 107? u, 
(sa* — #) = (6M + AM) = (2.8162 + 18.519) x 107? 
= 21.3352 x 107° u, 
(ga* — M)c? = 21.3352 x 1073 x 931.481 = 19.87 MeV, (4.148) 
which is the energy level of the compound nucleus ®Be* above the ground 


state of ®Be. This result concurs with the value of 19.9 MeV found in [15, 
p. 203]. 


Answer to Part (c). Let us compare the mass of two a-particles equal to 
2u. with the mass of *Be, equal to Al. 


Qu — #1 = 2 x 4.0026 — 8.005305 = 8.0052 — 8.0053 = —0.00001 u 
= —0.0931 MeV /c?. 
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Compare this with the energy level (—0.096 MeV) for 2a shown in [15, 
p. 203]. The small discrepancy can be due to error margins of the mass 
values taken from [16]. 

Let us follow up by energy conservation of the decaying nucleus °Be* 
in its rest frame. Let k(a) stand for the kinetic energy of each a particle 
(they are equal, as they move apart in opposite directions with equal and 
opposite momenta). 


A" c* = 2(uc? + k(a)), 
or, 8.0266 = 2 x (4.0026+ k(a)/c”), from (4.146) and (4.144h), 


1 
or, k(a)/e? = 5 x (8.0266 — 8.0052) = 0.0214/2 = 0.0107 u, 


or, k(a) = 0.0107 x 9 x 10° = 9.63 x 10** u.(m/s)? 


9.63 x 10! 
~ saeco =9.97MeV [we used (4.144d)] 


(4.149) 


Let us compare k(a) with yc?. From (4.144d) and (4.144f), pc? = 
4.0026 u.c? = 4.0026 x 931.481 = 3728 MeV. Thus, k(a) < pc?, and the 
N.R approximation is valid. See the comment following (4.147). Therefore, 
we can write 


p(a) = \/2uk(a) = V2 x 4.0026 x 9.63 x 10! = 8.78 x 10’ u.m/s 


pia) 8.78 x 10” u.m/s 7 2 
2 eS 1 ee 
are 4.0026 u oe MRS ee 


(4.150) 


The a particles emerging from the decay of °Be* are far below the 
relativistic domain. 


Problem 4.7. In this exercise, we shall compute the velocity of the zero 
momentum frame (Sec. 4.7) using three examples. 


(a) Consider the first part of the reaction given in Problem 4.5, i.e. {p + 
"Li 38 Be*}. Find the momentum and the velocity of p impinging on 
the Li nucleus with kinetic energy 3 MeV. 

(b) Find the velocity of the CM before and after the reaction, in the Lab 
frame. 

(c) Consider the second part of the reaction given in Problem 4.6, i.e. 
{8Be* + a+ a}. Find the velocity of the CM before and after the 
reaction, in the ZM frame. 
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Answer to Part (a). In the following equation, the subscript “,” represents 


proton. 


Pp = 2MpKp = V2 x 1.00782 x 3 x 9.662 x 1018 


= 2.417 x 10’ u.m/s, 


_ Pp _ 2A17 
Mp, 1.00782 


Up 


i 
x 107 = 2.398 x 10’ m/s 0° (4.151) 


Answer to Part (b). Let us first find the relativistic masses of p, “Li, °Be* 
in the atomic mass unit u. 


1 1 


Ta oR 


Cc 


= 1.01; 


T(7Li) = 1.P(8Be*) ~ 1. 
“Mp = 1.01 x 1.00782 = 1.0179; m(7Li) 
= M =7.016004; m(8Be*) = gi“ = 8.02662. (4.152) 
Pp + Pui 2.417 x 107 +0 


Mp +myz; 1.016004 + 7.01604 


*. Ucm,before = 


G 
= 0. 10° Si 
0.3107 m/s~ 
P c 
Vem,after = a fo Too’ See Eq. (4.147). 


Answer to Part (c). This is too trivial. The momentum of the parent 
nucleus is zero before the decay. The momenta of the daughter a-particles 
are equal and opposite. Hence P = 0 before and after the decay process. 
Hence, Vem is zero always. 


4.12. I[llustrative Numerical Examples II 


Example 4.1. We observe two galaxies A and B moving in opposite direc- 
tions with speeds 0.5c and 0.4c, respectively. What is the velocity of B as 
seen from galaxy A? 


Solution. The specified velocities v4 = —0.5c, ug = 0.4c are with respect 
to “our” frame S. Let S’ be the frame of the galaxy A. Then the required 
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velocity vu’, of B is obtained by applying the velocity addition formula (4.6). 


; 0.4 — (—0.5) 
UB SE SSS 

1 — (0.4)(—0.5) 
Example 4.2. In the Lab frame a 7* is moving in the positive X-direction 
with velocity 0.8c and a m~ is moving in the negative X-direction with 
velocity 0.9c. Find (a) the velocity of 7~ in the rest frame of 7*, (b) the 
velocity of 7+ in the rest frame of 77. 


Solution. This is similar to the previous problem. The given velocities 
Un+ = 0.8¢, v;- = —0.9c are with respect to “our” frame S$. Let us repre- 
sent the rest frames of t+ and m~ as S$’, and $”, respectively. We get the 
answers as follows. 


; “09-08 


(a) vu, = T=(-0.5)(0.8)° = —0.988c. 
ee en 
(b) uy = T—(8)(20.5)° = 0.988c. 


Example 4.3. A rod of proper length Zp = 2m is moving in the +X- 
direction with velocity 0.6c. An electron is moving in the —X-direction 
with velocity 0.8c. Find the time T that the electron takes to traverse the 
length of the rod, as seen from the Lab frame S$? Answer the question in 
the following three ways and see that the answers agree. 


(a) View the motion from the frame S. 

(b) View the motion from the electron’s rest frame F, and convert the time 
thus obtained into Lab time T. (Hint: use velocity addition formula 
and Rules 2 and 3) 

(c) View the motion from the rest frame of the meter-stick and convert 
back to Lab time 7. (Hint: Use velocity addition formula and LT.) 


Solution. (a) Let vap be the velocity with which the rod and the electron 
are approaching each other, as seen from the Lab frame S, and let D be 
the length of the rod in frame S which is related to the proper length 
Lo by the length contraction formula (2.26). We need to calculate the 
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Lorentz factor y for transformation from the Lab frame to the rest frame of 
the rod. 


Vap = 0.6c + 0.8¢ = 1.4, 
— 1 _ — 
US /t=0e 08 


p279 96982 1.6m, 


5 
4’ 


1.6 1.6 _8 

oor ide L2K8 1 = 0.38 x 107° s. 
(b) We shall first find the time 7 of traversing the length of the rod in the 
electron’s rest frame E. For this we need to know the velocity c6’ and the 
length ¢’ of the rod in the frame E. We apply the velocity addition formula 
to get the first answer, and the length contraction formula (2.26) to get the 
second answer. For this second answer, we need the Lorentz factor 7’ for 
transformation from the frame F to the rest frame of the rod: 


0.6 — (0.8) 1 
fc ee = pa Al 

P = 7 (6)(—0.8) oT = A=oo 

te 2 
eet el | 

7 3.04 mre 

L/ 

Oboe = 0.23 x 1078s. 


~ Ble 0.946 x 3 x 108 


We shall now convert this proper time into the Lab time 7’ using the time 
dilation formula (2.23). For this, we need the Lorentz factor y. for trans- 
formation between the Lab frame S and the electron frame E: 


1 1 i) 


ee Vi-082 06° 3 
T= = x 0.23 x 1078 = 0.38 x 1078s. 
(c) This is left as an exercise for the reader. 


Example 4.4. A proton is moving in the X-direction with velocity 0.999c. 
The rest mass of a proton is m, = 1.0078 u, lu= 1.66 x 107?’ kg. Find 
(a) the energy equivalent of 1 u, denoted as €, (b) the energy equivalent 
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of the rest mass of the proton, to be denoted as Ep, (c) the Lorentz fac- 
tor [ for the moving proton, (d) the momentum p of the moving pro- 
ton, (e) the energy £ of the moving proton. Express energy in MeV, and 
momentum in MeV/c. Use the conversion factor 1 J = 6.242 x 101? MeV. 


Solution. Let us denote 1 u as p. Then mp = 1.0078. 
(a) € = pc? = 1.66 x 10-2” x (3 x 108)? = 14.94 x 10-' J 
= 14.94 x 10-1" x 6.242 x 1012 MeV = 932.56 MeV, 
(b) Eo = mpc? = 1.0078 € = 939.83 MeV, 
1 


c) T=————— = 22.36, 
(©) V1 — 0.9992 
B 


d) p=Im,v =ImpeB =Tmyc? x &. 
Pp P Pp c 


= 22.36 x 939.83 x 0.999 MeV/c = 20970 MeV/c, 
(c) BE =Tmyc? = 22.36 x 939.83 = 20991 MeV. 


4.13. Exercises for the Reader I 


Listed below are values of some physical constants which the reader may 
need in working out some of the exercises. (The mass values are taken 
from [16].) 


c=3x 10°km/sec., h = Planck’s constant = 6.63 x 10734 J.sec. 


Masses of e~ (i.e. electron), p (i.e. proton), n (i.e. neutron), a (i.e. alpha- 
particle), 7Li nucleus are to be taken as me = 5.491074 u = 0.511 MeV/c’, 
Mp = 1.0078u, my, = 1.0087u, ma = 4.0026u; m (7Li) = 7.0160u; where 
“yu” means “atomic mass unit (1?C scale)”. lu = 931.48 MeV/c?. leV = 
1.6021 x 10719 J. 


R1 We observe two galaxies A and B moving in opposite directions with 
speeds 0.5c and 0.4c, respectively. What is the velocity of B as seen from 
galaxy A? 


R2 In the Lab frame a m* is moving in the positive X-direction with 
velocity 0.8c and a m~ is moving in the negative X-direction with velocity 
0.9c. Find (a) the velocity of m~ in the rest frame of zt, (b) the velocity 
of zt in the rest frame of 77. 


R3 A meter-stick (proper length 1m) is moving in the +X-direction with 
velocity 0.6c. An electron is moving in the —X-direction with velocity 0.8c. 
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How long does the electron take to pass the meter stick, as measured in the 
Lab frame L? Answer the question in the following three ways and see that 
the answers agree. 


(a) View the motion from the frame L. 

(b) View the motion from the electron’s rest frame E, and convert the time 
thus obtained into Lab time. (Hint: use velocity addition formula and 
Rules 2 and 3.) 

(c) View the motion from the rest frame of the meter-stick and convert 
back to Lab time. (Hint: Use velocity addition formula and LT.) 


R4 Two straight rods R, and Rz of proper lengths ZL; = 15m and Lz 
20m are moving in opposite directions with velocities 6 = 3 and 8 = 
respectively as seen from the Lab frame. Find the time required by the rods 
to traverse the length of each other. Answer the question in three different 
ways and see that you get the same answer. 


ous || 


(a) View the motion from the Lab frame L. 

(b) View the motion from the rest frame of R; and convert the time thus 
obtained to Lab time. 

(c) View the motion from the rest frame of Rz and convert the time thus 
obtained to Lab time. 


R5 How much weight is gained or lost when 1 tonne ice melts? Take latent 
heat of fusion of water as 3.34 x 10° J.kg™t. 


R6 A proton is moving in the X-direction with velocity 0.999c in the 
Lab frame. (a) Find the energy and momentum of the particle in the Lab 
frame. (b) Using energy-momentum transformation equations (4.83) deter- 
mine the energy and momentum of the particle in the frame S’ under the 
boost: Lab(0.990, 0, 0).S”. 


R7 Answer the same questions, asked in Problem 4.6, for an electron of 
the same energy. 


R8 Answer the same questions, asked in Problem 4.6, for an a particle of 
the same energy. 


R9 Find the momenta and velocities of the following particles. (a) 1 MeV 
electron, (b) 1 MeV proton, (c) 1MeV a particle, (d) 1GeV electron, 
(e) 1 GeV proton, (f) 1GeV a particle, 
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R10 A hydrogen atom decays from the level n = 2 ton = 1. The energy 
levels of the hydrogen atom is given by the formula E,, = oe eV. The 
rest masses of a proton and an electron are m, = 938.3MeV and me = 
0.511 MeV. Using formulas (4.136) and (4.137) determine 


(a) the momentum of the recoiling atom, 
(b) the energy of the emitted photon. 


R11 “Li nucleus consists of 7 nucleons of which three are protons and 
four are neutrons. Determine the energy in MeV required to dissociate 
this nucleus into its seven constituent nucleons. This energy is called the 
binding energy of the nucleus. 


Answers to Selected Exercises 
R10.75c. R2-—0.998c. R30.19x10-§s. R4120/7c. R5 0.37 x 
10-* ei, 


R6(a) E = 20,991MeV; pe = 20,970MeV. R6(b) BE’ = 1631 MeV; 
p'c = 1335 MeV. 


Part II 
Amazing Power of Tensors 
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Chapter 5 


Let Us Know Tensors 


We shall now make a preparation to launch a vehicle that will take us from 
our mundane three-dimensional physical space, namely the Euclidean space, 
and denoted by the symbol E%, which is encompassed by the three Cartesian 
axes X,Y, Z, or better still, spanned by the three unit space vectors i, j,k, to 
a more esoteric and adventurous world of four-dimensions with the addition 
of just one more axis, the time axis cI’. This new four-dimensional world 
will be called Minkowski space-time, and denoted by the symbol M*. At 
the heart of this mathematical construct is Minkowski’s assertion (which 
we are restating in a modern style) that any phenomenon in physics must 
be expressible in the form of a tensor equation in which both sides must 
be a tensor of the same rank, and the same sequence of contravariant and 
covariant indices. Such a statement, without proper clarification, will scare 
the reader, as they have done to umpteen ordinary persons, who have been 
fed with stories and myths of a Relativity demon, living in a lofty mountain, 
beyond the range of ordinary vision. 

Our objective is to demystify this demon. Nothing that Einstein or 
Minkowski said can be above the level of a student who has studied calculus 
and the basic principles of electromagnetism. In fact, the basic impulse 
behind Einstein’s construction of Special Relativity, his puzzle and how he 
overcame it, were worked out in a series of exercises, all of which can be a 
standard set of homework problems in a serious course in Electromagnetism 
at the undergraduate level. 

We have presented an exposition of the original papers of Einstein and 
Minkowsi in two articles which can be downloaded from the website of this 
author (see Preface). 
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In this part of this book, we shall make a special effort to explain what 
is a tensor [17]. We encounter this strange object as stress tensor in engi- 
neering mechanics (sometimes without being aware of it). We shall find 
out how the same object makes a rebirth in M* as a 4-tensor. It is our 
hope that the step-by-step approach we have undertaken in the following 
sections will equip the reader with the necessary gear for climbing the Rel- 
ativity mountain. 


5.1. Introduction to Tensor 
5.1.1. Vector—tensor analogy 


It may help the reader get an intuitive impression of a tensor if we tell 
him what is common between a vector and a tensor. In fact going by the 
gradation of tensor, a scalar quantity, like the electric charge of a particle 
(e.g., an electron), is a tensor of rank 0. A vector quantity like the momen- 
tum of a particle is a tensor of rank 1. The inertia tensor of a rigid body, 
from which its moment of inertia about any axis can be obtained is a tensor 
of rank 2. 

In this chapter, we shall use the term tensor to mean a tensor of rank 2, 
even though one can build a tensor of arbitrary rank n > 2, in which we 
are not interested. 

In Fig. 5.1(b) we have drawn a vector V, by which we mean a directed 
straight line segment of a measured length. We have chosen a set of 
Cartesian axes X,Y, Z, and projected the vector on these axes, by dropping 
perpendicular straight lines on these axes, with intercepts V,,V,,Vz. We 
call these intercepts the scalar-components of V associated with the respec- 
tive axes, their directions represented by the unit vectors (also called the 
base vectors) e,,e,,e,. We can then write V either as a column matrix, 
or as a linear superposition of the base vectors with (Vz,Vy,Vz) as the 
coefficients. 


V=| Vy | = Vrez + Vyey + Vzez. (5.1) 


We can project V on any arbitrary direction represented by the unit 
vector n and get the intercept V,, and call it the scalar component of V in 
the direction of n. Mathematically, we obtain V, by taking a dot product 
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AY 


e e 
_ Real line 0. Real line + 


(a) 


Fig. 5.1. Scalar, vector and tensor as geometrical objects. 


of V with n. Let n =n ,e; + nyey +nze,. Then 
Va 
V,=V-n=| Vy | (me ny nz) = Vong + Vyny + Vine. (5.2) 
V, 
By analogy, we can think of a mathematical object T having vector- 


components T,,Ty,Tz associated with the axes X,Y,Z, as shown in 
Fig. 5.1(c), and call it a tensor. We can then write 


T; 
T = |W, | = T6286, + Pee: (5.3) 
T, 
We can take a dot product of T with n, and call it the vector-component 


of T associated with the direction n = Nzlx + Nyey +nzez, and represent 
it by T,,: 


T 
c= Ten= Ty | Ge ty te) = Tene + Tyitty + Tig. (5.4) 
T 


z 
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Note the most distinguishing features of a vector and a tensor. The 
scalar components (V;, V,, Vz) uniquely determine the vector V. The vector 
components (T,,T,,T~) uniquely determine the tensor Lr 

Scalars, vectors, tensors are all geometrical objects* illustrated in 
Figs. 5.1(a)-5.1(c). They can be represented, respectively, as a point (on 
the real line), a measured and directed straight line V, as a triplet of straight 
lines (Tz, Ty, T:). 

We shall now take a rigorous look at tensor, using a language which can 
be somewhat abstract. 


5.1.2. Linear operator in a vector space 


We shall begin by explaining what we mean by linear operator in a vector 


space. 
By the three-dimensional linear vector space Y we mean the set of all 
vectors A, B,C,... we can think of and all such vectors we can construct 


by combining them linearly, e.g. 7A + AB where 7, A are real numbers. 

Let us think of two vectors C and D having Cartesian components 
(C,, Cy, Cz) and (Dz, Dy, D-) and related to each other in such a way that 
the values of the former determine the values of the latter. This means 
that C is an independent vector and D is a dependent one. In other words, 
D is a function of C. Let us further assume that D is proportional to C. 
That is, if for example we double C, then D is doubled. These two vectors, 
however, may or may not be in the same direction. In that case, we say 
that a linear operator O transforms C into D. We may like to write this 
transformation symbolically as 


O(C) =D. (5.5) 
The property of linearity means that 
If O(C) =D and O(E) =F, then O(aC + bE) =aD+bF, (5.6) 


where a,b are two arbitrary scalar constants. 

In Fig. 5.2 we have shown two simple examples of how the operation 
O can take place. In Fig. 5.2(a), we have shown a particle of constant 
mass m in arbitrary motion along some trajectory [. At some instant of 
time t it has velocity v. Therefore, its momentum at the same instant is 
p = mv. We can therefore think of the operator O transforming velocity 


4 As stressed by Misner et al. [5, Sec. 2.2]. 
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_ 
(0) 


Us 


(b) 


(c) 


Fig. 5.2. Two examples of how a linear operator O transforms a vector into another 
vector: (a) O acting on v yields p; (b,c) O acting on w yields L. 


v into momentum p by scaling the length of the former by the factor m 
without changing its direction. 

In Fig. 5.2(b), we have shown a rigid body rotating about some axis 
pointing in the direction of the unit vector n with angular speed w, so 
that its angular velocity is w = wn. Its angular momentum is L, which 
(in general) does not coincide with the direction of w. In this case, the 
operator © transforms the angular velocity w into angular momentum L 
by changing the length as well as the direction. The linear operator O in 
this case is the inertia tensor Z about which we shall give some more insight 
in Sec. 5.1.5. 

For our immediate purpose, we shall look upon a tensor T as a linear 
operator. The linear operation mentioned above suggests that T can be 
represented by a matrix, and the “tensor operation” can be represented as 
a matrix multiplication. This will become evident in the next section. 


5.1.3. Tensor as a dyadic 


Two arbitrary vectors A, B can be combined in three types of “multiplica- 
tion operation”, the first two of which the reader is familiar with, namely, 
(1) the dot product A -B which is a scalar; (2) the cross product A x B 
which is a vector. Now comes (3) the third type, namely the dyadic product 
AB, which is a simple juxtaposition of the vectors, without any dot or cross 
in between, which we shall call a dyad. 


bSee a good book on intermediate mechanics, e.g. Ref. [18, Chapter 10], which also 
discusses Inertia Tensor in detail. 
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We define the dyad AB to be a linear operator which converts any 
vector C to another vector D and this conversion can be done in either of 
the following two ways: 


operating on the right: AB-C def A(B-C)=7A 

where 7 = B- C = scalar; (5.7a) 
operating on the left: C: AB“ (C- A)B=B 

where \ = C- A = scalar. (5.7b) 


The linearity property follows from the operation defined in (5.7). Also 
note that in general, AB 4 BA. 

We shall write the sum of two dyads AB and EF as AB + EF and 
define it by the distributive property: 


(AB+EF)-C@ AB.C+EF-C=A(B-C)+E(F-C), 


C-(AB+EF) @ C-AB+C-EF =(C-A)B+(C-E)F. 


(5.8) 


It should be a simple exercise to show from Eq. (5.7) that the dyadic 
product is distributive, i.e. if E, F,C are three arbitrary vectors, then 


(E+ F)C = EC+ FC, 
(5.9) 
C(E+F) = CE+ CF. 
As a corollary, 
(A+ B)(E+F) = AE+ AF + BE+ BF. (5.10) 


A sum of dyads can be called a dyadic. We shall prefer to use the term 
“dyadic” as a general name for sums of dyads as well as individual dyads. 

We shall frequently use the symbols e;,e,,e, to represent unit vectors 
in the directions of the X-, Y-, Z-axes, for which we had used i, j, k earlier in 
this chapter. As we progress, we shall use another set of symbols e1, €2, e3 
to mean the same unit vectors. This transition (i,j,k) - (ez,e,,e:) 
(e1, €2,€3), side by side with (x,y, z) > (#1, 22,23) will restore symmetry 
and help us use Einstein’s summation convention (following Eq. (5.16). 

The unit vectors (e1,e2,e3), in both Cartesian and spherical coordi- 
nate systems, form an orthogonal right handed triple and this property is 
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expressed as 
@, -€2 = €2:e€3 = €3:° €j = 0, 
e, ee; =e2:-e2 = e€3:e3 = 1, (5.11) 
e; X €g = €3; €2 X €3 = €1; €3 X Cy = €g; 
or, more compactly as 
ej, e; = big, 
(5.12) 


ej xX Cj = Cijkek, 


where 6;;, called Kronecker delta, and €;;,, called Levi-Civita Symbol, are 


defined as 

a a, (i atta, 
di = OY =O = et (5.13) 

0, ifiA7. 

1 if ijk = 123, 231,312, 
cigk = & —1 if if = 213, 321, 132, (5.14) 
0 ifi= 7, orj =k, or, k =i. 

Let us now consider the set of 12 dyads: {ezez, rey, Crez,...,e2€:}. 


Using them we can construct the following dyadic 


T = Trr@r€x P Tyx€yex a Pe Tyz@y€z + Tz2€z€z 
3.3 
= pee Tj @ie; = Tij@ie;; (5.15) 
i=1 j=1 
where the subscripts (1, 2,3) represent (x,y, z), respectively. That is 
Qj Sey, CQ = ey; €3 = Cz; 


and, Ti, =Tie;} The =Try3--: 
132 = Ty; 133 = Tez 


: (5.16) 


are arbitrary real numbers. 

In the second line of Eq. (5.15), we have introduced Einstein’s summa- 
tion convention: sum over repeated index, without explicitly inserting the 
sum symbol 5°. The subscript “;” appears twice, implying a sum over i. 
The subscript “;” appears twice, implying one more sum, this time over j. 
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The mathematical object T appearing in Eq. (5.15) is what we 
shall call a tensor for all purposes in this book. The set of dyads 
{€r€xr, rey, Crez,...,e2e,} can be looked upon as a complete set of base 
dyads forming a basis B in the tensor space T of T. This is analogous to 
the way that the vectors {e,, e,, e:} form a basis B in the vector space ) 
of V. Any arbitrary vector V can be written as a linear superposition of 
the base vectors as 


V = Vries + Vyey + Vez, (5.17a) 
where V; = V- ez, Vy = V- ey, Vz = V- ez, (5.17b) 


are the Cartesian (scalar) components of V in the basis B. In the same way 
any arbitrary tensor T can be written as a linear superposition of the base 
dyads, as in Eq. (5.15), where the nine quantities {T2, Try,...Tzy, Tz} are 
to be interpreted as the Cartesian (scalar) components of T with respect 
to this basis B. 

From the definition of dyad given in (5.7), and the orthogonality of the 
base vectors {e,,e,,e-}, ie. 


€; en = Oj, Jk=1,2,3=2,y,2, (5.18) 


it should be apparent that the base dyads operating on any arbitrary vector 
V will yield the following vectors: 


€r@r*V =ezVz; Cxey: V=ezVy; °° 5 C2ey V = ez Vy; 
(5.19) 
Vere; = Vrez; V-erey = Vrey;:-: ; V- ezey = Vzey. 


Hence, if A = A,e, + Ayez + Aze, and B = B,e; + Bye, + Bzez are two 


arbitrary vectors, then, A-T-B& A. (T-B) = A;7j;B; =(A-T)-B. 


n~ 


Special case: e; - T- e; = Tj. (5.20b) 


If the nine components {7;,;} of a tensor T are given, the tensor can 
be constructed using Eq. (5.15). Conversely, if a tensor T is given in the 
form of a mathematical relation, its nine components T;; can be retrieved 
by means of Eq. (5.20b). 


Let Us Know Tensors 149 


Using the distributive property given in (5.10) it is seen that the dyadic 
product of A and B has the following dyadic representation: 


AB = A, By ezez + A, Byezey + ++: + A, Byeze, + A,B.e-e, 
= A; Bjeie;. (5.21) 
Hence, if we write 
T=AB, then Ti; = A;B;. (5.22) 


Using Eq. (5.7a), the operation of the tensor T on the vector C = Cher 
placed on the right works out as follows: 


T -C= (Tije:e;) (Crex) 
= Ti; Chex (e; : ex) 


We have used the orthogonality relation (5.18) to get to the last line. 
In a similar way, using Eq. (5.7b), the operation of the tensor T on the 
vector C = Cex placed on the left works out as follows. 


C : T = (Cyex) : (Tijeie;) 
= CyTi; (Cx ¢ e;)e; 
= (Cy. Tr; )e;- (5.24) 


The above two equations suggest that if we write D = T-CandF=C. T, 
then the Cartesian components (D1, Dz, D3) of D and (Ff, Fb, F3) of F can 
be obtained from matrix multiplications: 


dD, Ti1 Tig Tis C1 
Dz | = | Tor Tho To3 Co |, (5.25a) 
Ds T3i T32 Ts Cs 
Ti Tig Tis 
(Fi Fo F3)=(Ci Cy C3) ] Ta Teo Ts |. (5.25b) 
31 T32 Ta 


In the above equations, starting from Eq. (5.7), we have used a dot (-) 
to separate the tensor from the vector on which it is operating. We shall 
frequently refer to a tensor operation as a dot product between the tensor 
and the vector. Equations (5.25a) and (5.25b) show that a dot product 
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actually involves a matriz multiplication. A tensor is to be represented as 
a square matrix, and a vector either as a column matrix or a row matriz, 
depending on whether the tensor operation is on the right or on the left.° 


Ty Tho Ti3 Cy 
T= | Ta Too The | =[T], C=] Co | ={C}, 
T3; 32 33 C3 (eZ) 


F=(7, & F) =(?): 


In the above equations, we have adopted the convention of indicating a 
3 x 3 square matrix by [], a 3 x 1 column matrix by {}, and a 1 x 3 row 
matrix by (). Hence, Eqs. (5.25a) and (5.25b) can be written as 


{D}=[T]H{C}, ()=(C)7}. (5.27) 
It follows from Eq. (5.21) that the matrix representation of the dyadic 
AB is 
AiB, Ai By, ABs 
AB= AoB, Ao Bo Ao B3 ‘ (5.28) 
A3B, A3Bz A3B3 


We shall define the dot product of two tensors S and T as the tensor 
R =S.‘T by its operation on an arbitrary vector C on the right in the 
following way: 


(S-T)-C=8.(T-C). (5.29) 


From this, it follows that the matrix representing R is given by the product 
of the matrices representing S and T. That is, 


[R] = [S][T], implying: Rij = SixTh;- (5.30) 


It is then obvious that, in general, S-T x T<s. 


“In Quantum Mechanics (QM), a clear distinction is made between a vector A on left 
and a vector B on right, as in the scalar product A.B. The former is called a bra 
vector and the latter a ket vector, and together, in the scalar product, they constitute 
a bra-ket: A — (A|; B > |B); A-B => (A|B). However, these vectors are in general 
infinite dimensional, their components are complex numbers, and the components of the 
bra vector (A| are complex conjugates of the respective components of the ket vector | A). 
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Using the matrix representation as given in Eq. (5.30), and the tensor 
operation on the left as found out in (5.24), we can now see how the product 
tensor R = §-T will act on the left. 


C-R= (Cyftng ej = (CeStem I mye; 
= (CySkm)( Tmje;), (5.31) 
or C-(§-T)=(C-S)-T 


We can extend the definition of matrix product to any number of 
tensors, by writing the matrix representation of the product tensor as 
the product of the representative matrices of the component tensors. 
For example, 


ifR=A-B.-C, then [R] = [A] [B][C]. (5.32) 


At this point we shall add a word of caution. A tensor is not the same 
as a square matrix, just as a vector is not the same as a column matrix 
or a row matrix. The row matrix shown in Eq. (5.26), for example, gives 
the components of the vector F in a given coordinate system XYZ. As the 
coordinates are changed from (x,y,z) to (a’,y’,2’), the components will 
transform from (F)\, F2,F3) to (Fi, F, F5). However, the vector F itself 
is a “geometrical object” (a straight line of measured length pointing in 
an assigned direction) which remains invariant under all coordinate trans- 
formations. In the same way, the tensor Tisa geometrical object, which 
remains invariant under all coordinate transformations, even though its 
components will change from the square matrix [T;;] to another square 
matrix [T/;] under the same coordinate transformation. 

Yes, the components of all tensors will transform, except the compo- 
nents of the identity tensor which we shall introduce in the next section. 
They will remain the same, the same as in (5.34), following any coordinate 
transformations. 


5.1.4. Identity tensor, completeness relation, components 
of a tensor in the spherical coordinate system 


In matrix multiplication one needs the identity matrix 1 which in the 
present context, is the matrix representation of the identity tensor, also 
known by the alternative name idemfactor. It will be recognized by the 
symbol 1. Its sole property is that when it operates on any vector V, either 
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on the right, or on the left, it gives back the same vector. 

i-v@v, v-1#v. (5.33) 
Such a tensor must have 1 for its matrix representation. The dyadic repre- 
sentation (shown below) follows from the above property and the orthogo- 
nality relation (5.18). 


10 0 
1=1=/0 1 oO], (5.34a) 
001 


i=e.e,+ eye, +e, = e;e;. (5.34b) 


Equation (5.34a) gives the Matrix representation, and Eq. (5.34b) the 
dyadic representation. 

It will be advantageous to write the tensor T in a curvilinear coordinate 
system, in particular, spherical coordinate system. For this purpose, we 
shall write down the transformation equations for the coordinates and the 
base vectors: 


x=rsin@cosd, [0<r < od, 
y=rsindsing, [O<0<n], (5.35) 
z=rcos0, [O<¢< 2z]. 


é 


In the above equations, we have indicated the 
dinates within the [ ] brackets. 


‘ranges” of the three coor- 


e, = sin O(cos ge; + sin de,) + cos Jez, 
eg = cos (cos ge, + sin de,) — sin de,, (5.36) 
eg = — singe, + cos dey. 


Using these equations (and remembering that e,eg 4 ege,, for exam- 
ple), it should be a simple exercise to show that 


e,e, + egeg + egeg = ere, + eye, + e,e, = 1. (5.37) 


If we have three unit vectors {a, b,c} which are mutually orthogonal at 
every point in space and such that 


aa + bb+cc=1, (5.38) 


then we say that these three vectors form a complete orthogonal set, and 
hence a basis, so that any arbitrary vector V can be represented as a 
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linear superposition of these three vectors.4 This should be clear from the 
following: 


V=V-1=V.(aa+ bb+cc) = V,a+ b+ Ke, (5.9) 
5.39 
where V,=V-a,Vy,=V-b, V.=V-c, 


are the components of V in the directions of {a, b,c}, respectively. Using 
the completeness property, it can be advantageous to write a tensor in the 
following style. 


T=1-T.1= (aa + bb +cc)-T- (aa + bb + cc) 
=T,,aa+ T,,ab+ T,,ac+::-+T.,cb+T..ce, where (5.40) 
Tn =a-T-a, Ty =a-T-b,. Ty =e¢-T-b, T.=e-T-c 
are the components of T with respect to the basis {a, b, c}. 


We shall illustrate the operation shown in Eq. (5.40) by writing the 
tensor T in Cartesian and spherical coordinate systems: 


f= (e,€, + eyey + €ze,) - ls (e,e€, + eyey + €2e2) 
= Traxx + Tryerey + Trzexez +-+-+Tge,ey +Tz,e2e2, (5.41a) 
Tiss: =e,:T és, Tape Ope TSigice, Tey =e,-T-e,, (5.41b) 
Tz =ez° Ts e., (5.41c) 

T= (e-e, + egeg + egeg) - 7, : (e-e, + egeg + egeg) 
= T,,e,-e, + T,ge-eg + T,gereg +--+: + Typegeo + Topeges, (5.41d) 


T., =e, Te, Tro = Cr T-9,...,Ty9 = €g-T + 0, (5.41e) 


=) 


Toe = e¢° eg. (5.41f) 
Equations (5.41a)—(5.41c) represent the tensor T in a Cartesian coordi- 
nate system, and Eqs. (5.41d)—(5.41f) in a spherical coordinate system. 


4In QM, the completeness of a set of orthonormal vectors {|u;); i = 1,2,...,00} is 
expressed through the statement >7, |i) (ui] = = 1. This relation is used to change the 
representation of a Hermitean operator T, the equivalent of the tensors we are considering 
here. The (i,j) component of T will then be written as fe (us|Tlu;), the equivalent 
of Eq. (5.20b). 
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We can then write the components of T in the following matrix forms: 


Tra Ley Tez Tyr To Trg 
eo |e Ty De ly PS | ys Be Bale Gar 
Te Tey Tez Tor To To 


The first matrix gives the Cartesian components, and the second one the 
spherical components. 

Using the transformation of the base vectors (5.36), and the complete- 
ness relations (5.37), one can transform the Cartesian components to spher- 
ical components, for both vectors and tensors, as we shall show. For this 
purpose, we shall temporarily denote the spherical base vectors with a 
prime, i.e. {e; : i =r,0,¢} and make a table of transformation coefficients 


{cig}: 
“ @j€; = Cij@j, 
where cj; =e,-e;: i=7,0,¢; fj =2,y,2. 


, ee (5.43) 
sin@cos@ sin@sing  cosé 


=| cos@cos@ cos@sing —sind 
—sing cos 0 


Now, let V be a vector and T be a tensor with Cartesian components 
{Vj}, {Ziz}, 2,79 = x,y,z], respectively. Then the spherical components 
r / 7 a _ 
of the same vector and tensor, namely, [{V;}, {Tj}, i,7 = 7, 0,4] will be 
obtained in the following ways: 


Vj = V-e) = V- exe, -e; = cnVi, (5.44a) 


Tj; =e;:T-e; =e;-e,e,-T- ee, - ej 
= Cincy Trt. (5.44b) 


Note that we have used the summation convention: sum over & in (5.44a), 
sum over k,l in (5.44b). 


“In Tensor analysis, the primary language of the theory of relativity, the rule of transfor- 
mation has different forms for contravariant and covariant vectors, and for contravariant, 
covariant and mixed tensors. The rules we are establishing here are different from them. 
The components of vectors, tensors we are using may be called physical components, 
in contrast to their contravariant and covariant components for which a more elegant 
transformation rule is used. 
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We shall illustrate the transformation formulas (5.44) with two exam- 
ples, i.e., V, = Vi and Tyg = Tip. 


V, = sin@ cos @V; + sin 6 sin @V, + cos OVz, 
T,9 = sin 6 cos ¢(cos 0 cos Tre + cos 6 sin ¢Tyy — sin OT;z) 
+ sin 0 sin ¢(cos 6 cos Ty, + cos @ sin Ty, — sin OTyz) 


+ cos O(cos 6 cos Tz, + cos @ sin OT, — sin OT-,). (5.45) 


5.1.5. Example: Inertia tensor 


We shall illustrate the tensor concept by showing two important examples, 
namely (1) the inertia tensor and (2) the stress tensor. We shall take up 
a short discussion of the first example in this section leaving the second 
example, which needs a more detailed coverage, to the next section. 

In Sec. 5.1.2 we talked about the tensor operation converting the angular 
velocity w into angular momentum L. The corresponding operator is the 
inertia tensor Z of the rigid body. Its dot product with the angular velocity 
w gives the angular momentum L of the rigid body. That is, 


L=Z-w. (5.46) 


We shall find an expression for the vector angular momentum L of a 
rigid body which is rotating about a point O (which can be a moving point, 
e.g. the CM) with angular velocity w = wn about the axis pointing in the 
direction of the unit vector n. Let 7 be one of the constituent particles, 
having mass m,;, and located at the radius vector r; with respect to O, as 
shown in Fig. 5.2(c). The velocity of this point is v; = w x r;. Therefore, 
this particle has an angular momentum with respect to the point O, equal to 


£; =7; x pj =7%; x MiVi = MNjV 5 x (w x it) = my (rsw = (r; -w)ry]. 
(5.47) 


Assuming that the rigid body is made of N particles (which is a very large 
number), we add the angular momentum of each particle to obtain the 
angular momentum of the rigid body about the point O, given as 


N 
Lo =) > mi[rjw — (rj -w)rj). (5.48) 
j=0 


We can write the quantity within square brackets as 


[r2w — (rj -w)rj] = [r71 —rjrj]-w, (5.49) 
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and construct the inertia tensor as the dyadic (sum of infinitely small dyads) 
_ NN 
T=) mr? — 1,73). (5.50) 
j=0 


Then, we get the angular momentum as the dot product 


Lo =Z-w. (5.51) 
We have thus derived Eq. (5.46), and along with it have found an expression 
for the inertia tensor in Eq. (5.50). Note that the expression within the 
square brackets is the difference of two dyadics, namely, the identity dyadic 
| multiplied by the scalar ae and the dyadic product of r; with itself. 
For further clarification we shall write down the components of the ten- 
sor. Assuming that the rigid body has uniform mass density p distributed 
over its volume V, the sum in Eq. (5.50) becomes the integral: 


T=p / | I [r21 — rr]d?r. (5.52) 


Some of its components are 


te off fe —2*|@r= eff{fw +2) e'r; 


(5.53) 
Loy = =p II (xy) d°r; ete. 
Vv 
It is now seen that the inertia tensor is a symmetric tensor, i.e. 
Loy =Lyn3 Dyn =Layj Lee = Lez: (5.54) 


This symmetry property is preserved under all coordinate transformations. 


5.2. Stress in a Medium 
5.2.1. Stress vector 


By (mechanical) stress we mean internal forces (in the form of intermolec- 
ular forces) called into play when bulk matter, either in the form of solid, 
liquid or gas, is subjected to external forces. These internal forces exist 
throughout the bulk matter and its mathematical expression is given by a 
stress tensor field T(z, y, 2). 

For simplicity we shall consider a solid block in Fig. 5.3(a). It has been 
cut into two parts, the upper block YU and the lower block £, by an imaginary 
plane %, leaving a trace T' of its boundary. This plane is identified by the 
unit normal vector n pointing from the lower block to the upper block. 
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(d) 


Fig. 5.3. Explaining the stress tensor. 


In Fig. 5.3(b), we have shown the lower block £ with the plane of sep- 
aration © exposed. Let us consider a small area da at the point P(z, y, z) 
inside the solid, but lying on this plane. Then the stress vector FT" (x, Y,2) 
is defined to be the force per unit area at P(x, y,z), exerted by the atoms 
of the upper block Y/ on the atoms of the lower block £ across the plane n. 
The infinitesimal force acting on the area da is then 


dE = T(x, y, ) da. (5.55) 


Note that in general the direction of the stress vector_ T (a,y,z) is 
different from the direction of the normal n. If, however, 7 (x,y,z) || n 
(ie. perpendicular to the plane), the stress (vector) is called normal stress. 
If 7 (x,y,z) Ln (ie. parallel to the plane), it is called shear stress. 


5.2.2. Stress tensor 


Let us go back to the matrix representation of T given in Eq. (5.42), and 
be specific that we are considering only the Cartesian components. Take 
the dot product of T with e,, using the dyadic form (5.15), and call it 
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the vector component of T associated with the X -direction, and write it 
as T®): 
T@) = se ‘Qe; = Teej ei “ey = Tye; = Tre@e + Tyeey + Treez. (5.56) 
Comparing with (5.42) we notice that the columns 1, 2, 3 of the 


Cartesian matrix T represent three vectors T), T®,T©, respectively, 
each as a column matrix. 


Dig yy te 
T=| Tye Ty Tye | =(T® T® T), (5.57) 
Tee Try Tez 
Tex Tey Tez 
Ne Ts | as. es (5.58) 
Te. am Tez 


Now let n = nzey + Nyey + nye, be a unit vector, representing some 
direction in space. Let us construct the dot product of T with n 


TT =T-n. (5.59) 


We shall call T™ the vector component of the tensor T associated with 
the direction n. 

The above dot product operation can be represented as the matrix mul- 
tiplication: 


io i, ty EA fa 
T™ = is =| Tye Ty Ty My |: (5.60) 
Ti”) sae Toy Tez Nz 
or, more compactly as 
Ne 
Te = (To) TY sla ny, | = T@) n+ TOn, + TR, (5.61) 
Nz 


Let us specialize T to stress tensor 7. 


T) TY) Te 


ab) 
I 
2 


1 
y Tez |- (5.62) 
Ty 
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Note from the above equation that in Jj; the second index j is the “surface 
index” (indicating the direction of the surface on which stands the stress 
vector TY )) and the first index 7 the “component index” (indicating 2, y, z 
components of T”), 

Now rewrite Eq. (5.61) as 


T® = TOng + Tony + TO ni. (5.63) 


The above equality involving the stress components can be proved using 
Newton’s second law of motion applied to a fluid in motion or a solid under 
deformation. In other words, the stress vectors on three perpendicular sur- 
faces determine the stress on any other surface pointing in any arbitrary 
direction. Therefore, stress J is a tensor as per the qualification written in 
Sec. 5.1.1 

In Fig. 5.3(e) we have shown the upper part of the solid of Fig. 5.3(a), 
and the same area da as in Fig. 5.3(b), but now on the upper block U/. The 
normal vector now is —n, and the stress vector is 


Tom (s; Y, z) = T (a, Yy, z) . (—n) = -T™ (ge Y, 2), (5.64) 


so that the force exerted by the atoms of the lower block £ on the atoms of 
the upper block UW across the same area da is dF“ = —T' da = —dF™). 
Which is in conformity with Newton’s third law of motion. 

In obtaining the last equality in Eq. (5.58) we have used the linearity 
property of the tensor as stipulated in (5.6). In this case T - (an) = a7 -n 
where a = —1. 

Like the inertia tensor, the stress tensor is a symmetric tensor, i.e. 


which can be proved using the equation of motion of the angular momen- 
tum. 


5.2.3. Diagonalization of a symmetric tensor 


A symmetric tensor can be always diagonalized. By this we mean the fol- 
lowing. Let T bea symmetric tensor. Let its components with respect to 
some axes (XYZ) be {Tj;} and that T,; = Tj;. By a suitable rotation of 
the axes (XYZ), one can arrive at another set of axes (Xo ¥oZo) such that 
Ti =O if i Aj. 
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We can express this formally in the form of the following equation: 


P=|'T. Te Th, + T=|[0 % oO . (5.66) 
Tea Tey Tez) xyz OO! AE seers 


The axes (Xp Yo Zo) are called the principal axes, and the diagonal com- 
ponents (T), 72,73) are called the principal moments of inertia in the case 


of Inertia Tensor, and the principal stresses in the case of Stress Tensor.‘ 


5.2.4. Gauss’s divergence theorem for a tensor field 


When we say tensor field, we mean a physical quantity represented by 
a tensor T(c, y,z) whose nine components Ty.(x,y, 2), Try(x,y, Z),---; 
T.2(x,y, z) are defined at every coordinate point (x,y,z). We assume that 
these nine components are all differentiable functions of the coordinates 
x,y,z. For such a tensor field, we define its divergence to be the formal dot 
product of the grad operator V with the tensor T(c, y, Z), it being assumed 
that V will appear on the left. 
Let us write the tensor T by the dyadic representation 


T=T@e,+TWe, +Te,, (5.67) 
as in Eq. (5.3). Then 
div T =v si =V- (Te, + Te, + Te.) 
def 


= (V-T®)e, +(V-T)e, + (V > T Jez. (5.68) 


Note that V-T@,V-T™,V-T® are the familiar scalar divergences of 
the vector fields T®), T™, T@) respectively, 


OT, OT, OT. 
Ox Oy Oz” 
OT, OT, OT. 
SY) a eg ee 
V-T ae Dy ae (5.69) 
Vv : Te) = OT yx af OT y- zm OT.» 


Ox Oy Oz” 


fSee [18, Sec. 10-4]. 
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and constitute three (scalar) components of the vector V- T along the X-, 
Y- and Z-axes, respectively. Combining (5.68) and (5.69), we get 


V-T= Lee ae e;= ee e;. (5.70) 


In the second equality, we have employed Einistein’s summation convention 
(introduced on p. 147). 

The divergence of a vector field is sometimes interpreted as “outflux per 
unit volume”. This association of divergence with outflux is due to Gauss’s 
divergence theorem briefly recalled in Sec. 11.2.2. Applying the divergence 
theorem, to the three vector fields T), T), T©) separately, we get the 
following three equivalence relations: 


If V- T(r r= ff T(r) - n(r) da, (5.71a) 
If V- Tv. r= ff T(r) - n(r) da, (5.71b) 
If V- T(r yr = ff T) (r) « n(x) da. (5.71c) 


Multiplying either side of Eqs. (5.7la)—(5.71c) with ez, e,,e. respectively, 
and adding, we get 


// V (Te, + Te, + Te,)dr 
Vv 
= ffir De, +TMe, +T@e,) - nda. (5.72) 


Identifying the dyadic within the parentheses as the tensor T, we obtain 
the divergence theorem for the tensor field: 


// — T(r)@r = If. T(r) nda. (5.73) 


We shall find this theorem to be crucial for constructing Maxwell’s stress 
tensor in the next chapter. 
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Since the stress tensor 7 is symmetric, we can write its divergence as 
follows: 
OT; OT ji 
—e, = e,—+ = ej Tyi,i- (5.74) 


Viel Ox; aa J Ox; 


In the last equality, we have adopted the convention ®; = se The jth 
Cartesian component of the divergence theorem (5.73) can therefore be 


written as follows: 
OT; 
ae ear = If. TP csisties (5.75) 


We shall find this practice useful while writing covariant equations in the 
context of the Special Theory of Relativity. 

Most authors prefer the expression (5.74) for the divergence. However, 
following our chain of discourse leading to the construction of the stress 
tensor, the expression (5.70) seems to be most natural. 


5.2.5. Volume force density in a stress tensor field 


Figure 5.4 shows an imaginary rectangular box abcdefgh of infinitesimal 
dimensions 6x, dy, 6z inside a medium under stress (which may be matter, 
or field). The centre P of this box is located at the coordinates (2, y, z). 
Let us assume that the stress in the medium is given by the tensor field 


-T @5 x/2,y,,2) 


Tet 5x/2,Y,,Z) 


(a) (b) 


Fig. 5.4. Stress force on a volume element. 
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aA 


T (x,y,z), whose components are differentiable functions of the coordinates. 
We shall find the total force on this box due to this stress. 

We have shown in Fig. 5.4(a) the outward normal vectors (ez, e,,e.) on 
the three faces of the box that are exposed to our view. The outward nor- 
mals on the other faces which are hidden from our view are (—e,, —ey, —ez). 
We shall identify each one of the six surfaces of the box by their outward 
normal vectors. 

Let us consider the opposite faces abcd and efgh, recognized by the 
normals (e,) and (—e,). The locations of their centres are (x + %,y, z) 

6 


and (a — $4,y,z), respectively. The stress forces on these two faces are 


7 (: - . Ys :)  (+en)by5z2 = T € + ae y, :) dydz 


IT bx 
= | rey. ee 
~ (2, y,2) + Ox =| dydz, 


. OT ba 
== [7 V(x, y, 2) — 3] dydz, 


T) OT 


OF 4, +060F_2 = 2 dxdyoz = OV, (5.76) 


where 6V = dxdydz is the volume of the infinitesimal box. In the same way, 
we find the forces on the other four faces of the block. Adding the stress 
forces on all the six surfaces, we get 


6V (5.77) 


IT®™ aT aTX 
Ox ¥ Oy aE Oz 


as the total stress force on the box. The volume force density f,, which 
gives the stress force acting per unit volume of the media under stress, is 
then given as 


nN nN 


5 OT? , OTH | ATO _ Alee-T) , Aey-T) , Aex-T) 
8s Ar Oy Oz Ox Oy Oz 
Oe gg ee ep 
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Fig. 5.5. Stress forces on a bulk volume. 


f,=V-T. (5.79) 


One may conclude that total stress force F, on a bulk volume V carved 
out inside a medium M, as shown in Fig. 5.5(a), is the volume integral of 
the force density f, carried out over the entire volume VY. We shall carefully 
analyze the forces inside the medium before jumping into this conclusion. 

Let us consider a two-dimensional view of nine tiny, imaginary neigh- 
bouring blocks lying inside the medium and forming a group G. We have 
marked the blocks as A, B, C, D, E, F, G, H, K, with A at the centre. In 
Fig. 5.4(b) we have shown the forces on the four sides of A as F), Fo, F3, Fa. 
The force F; comes from the neighbour B, and by Newton’s third law of 
motion, A applies an equal and opposite force —F, on B. Similarly, the 
forces F2,F3,F4 come from the neighbours C, D, E. And A applies equal 
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and opposite forces —F2, —F3,—F4 on them. It may then appear that these 
internal forces, when added together, get cancelled out and there should not 
be any stress force on the group G at all. 

A close examination will disprove this judgement. We have surrounded 
G by an imaginary boundary surface J. It is now seen that even though 
the action—reaction forces cancel out in the interior of the group G, they 
survive on the boundary surface ©. These surface forces Fy1, Fo2,..., Foie, 
when added together constitute the total force F, on the group G. 

In Fig. 5.5(c), we have divided the volume Y into an infinite number of 
infinitesimal blocks. The interior stress forces between adjoining blocks will 
cancel out. However, the forces on the boundary surface, some of which we 
have shown as Fy1, Fo2, Fos, Fo, will survive and add together to constitute 
the net stress force F, on the volume VY. 

We now get a clue of how to find the net stress force F, on the volume 
Y. In Fig. 5.5(d), we have shown the volume VY once again. At a certain 
point P on this surface we have pictured a tiny patch of area da, on which 
we have drawn a unit outward normal n. The stress force on this patch is 
df, =T™ da = T - nda. Integrate this force over the entire boundary to 
get F,. We shall perform this integration and convert the surface integral 
into volume integral by applying Gauss’s Divergence Theorem as derived 


in Eq. (5.73): 
F, =f. Fir)-nda= fff v-Fe) er (5.80) 


which reconfirms Eq. (5.79) 

The simplest example of stress field in matter is provided by a perfect 
fluid, which by definition, does not support shear stress. Since tensile stress 
is also ruled out in a fluid, the stress field inside a perfect fluid is left with 
only normal compressive stress, which is known more familiarly by the 


name pressure, to be written as p(x, y,z). It is then obvious that the stress 
field in a perfect fluid is the “pressure tensor”, having only three equal 
diagonal elements p(x, y, z): 


pt, U2) 0 
T(r) = W(t) = —p(z,y,2)1 = — 0 p(a,y, 2) 0 
0 0 P(x, Y, Z) 
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According to Eq. (5.79) the volume force density inside a pressure field is 


given as 
f(x, y, 2) =—-V.- [p(a, y, z)1] = —Vp(2, y, 2), (5.82) 
and the stress vector on a surface n as 


Pe 7 a= pu (5.83) 


Chapter 6 


Maxwell’s Stress Tensor 


6.1. Introduction 


‘Action at a distance’ (AAD) was an enigma to natural philosophers, from 
Rene Descartes* (1596-1650) to James Clerk Maxwell (1831-1879). We 
find an account of the evolution of physical concepts in [19]. According to 
Descartes, space was a plenum, a medium called aether, capable of trans- 
mitting force on material bodies. “It was to be regarded as the solitary 
tenant of the universe, save for that infinitesimal fraction of space which is 
occupied by ordinary matter.” 

Subsequent theoretical physicists and mathematicians, Robert Hooke 
(1635-1703), Isaac Newton (1642-1727), Reimann (1826-1866), W. Thom- 
son (1824-1907), Maxwell and others lent their support to this view. 
Implicit in their belief was the assumption that force cannot be transmit- 
ted except by actual pressure or impact. AAD was a taboo, as abhorrent 
as witchcraft: I wave my hand here and a fire is ignited there. In order to 
support their faith in aether they contrived every possible idea, any possible 
mechanical model, to make aether viable. 

According to Newton “All space is pervaded by an elastic medium or 
aether, which is capable of propagating vibrations in the same way as air 
propagates the vibrations of sound. This aether pervades the pores of all 
material bodies, and is the cause of their cohesion; its density varies from 
one body to another, being greatest in the interplanetary space.” 


“The Cartesian coordinate system is associated with his name 
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Maxwell inherited this legacy. We shall quote a few passages from his 
celebrated paper A Dynamical Theory of the Electromagnetic Field read to 
the Royal Society of London on December 8, 1864 [20]. 


“(1) In this way mathematical theories of statical electricity, of 
magnetism, of the mechanical action between conductors carrying 
currents, and of the induction if currents have been formed. In 
these theories the force acting between two bodies is treated with 
reference only to the condition of the bodies and their relative posi- 
tion, and without reference to the surrounding medium.” 


“(2) The mechanical difficulties, however, which are involved in 
the assumption of particles acting at a distance with forces which 
depend on their velocities are such as to prevent me from consid- 
ering this theory as an ultimate one, though it may have been, and 
may yet be useful to the coordination of phenomena.” 


“(3) The theory I propose may therefore be called a theory of the 
Electromagnetic Field, because it has to do with the space in the 
neighbourhood of the electric and magnetic bodies, and it may be 
called a Dynamical Theory, because it assumes that in that space 
there is matter in motion, by which the observed electromagnetic 
phenomena are produced.” 


“(4) The electromagnetic field is that part of space which contains 
and surrounds bodies in electric and magnetic conditions. ... It 
may contain any kind of matter, or we may render it empty of all 
gross matter, as in the case of Geissler’s Tubes and other so-called 
vacua. 


There is always, however, enough matter to receive and transmit 
the undulations of light and heat, and it is because of the trans- 
mission of these radiations is not greatly altered when transparent 
bodies of measurable densities are substituted for the so-called vac- 
uum, that we are obliged to admit that the undulations are those 
of aetherial substance, and not of the gross matter, the presence 
of which merely modifies in some way the motion of the aether. 


We have therefore some reason to believe, from the phenomena 
of light and heat, that there is an aetherial medium filling space 
and permeating bodies, capable of being set in motion and of trans- 
mitting that motion from one part to another, and communicating 
that motion to gross matter so as to heat it and affect it in various 
ways.” 
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One aspect of the mechanical model Maxwell built up to present a com- 
plete picture of the electromagnetic field was the proposition that space, 
i.e. aether, can sustain stress, and a force is transmitted from one body 
(electrified or magnetized) to another by means of stress, in the same way 
a force is transmitted from one end of a cable to the other by means of 
tensile stress, and from one part of a beam to another by means of shear 
stress. 

In his two-volume book A treatise on Electricity and Magnetism 
Maxwell presents a complete formulation of the stress in the field (read 
aether) by constructing the Stress Tensor for the Static Electric Field [21] 
and for the Static Magnetic Field, in terms of the field potentials. 

We have derived the stress tensors for electrostatic field, magnetostatic 
field and time varying electromagnetic field in terms of the electric field E, 
magnetic field B in a unified manner exploiting the useful identity given in 
Eq. (6.7). 

Einstein’s formulation of the Special Theory of Relativity saw the 
demise of the Luminiferous (i.e. light carrying) Aether. Light travels in 
empty space, electric and magnetic forces also propagate from one body to 
another (with the speed of light) in empty space. Is there then any place for 
Maxwell’s stress tensor? Is it only for historical reason that we are writing 
this long article? We shall attempt to provide the answer in four steps. 

First, it is indeed an amazing thing that the force acting on an isolated 
body A (which may consist of electric charges and currents), due to the pres- 
ence of charges and currents elsewhere, can be computed exactly by drawing 
a boundary surface S of our convenience surrounding A, as in Fig. 6.1(a), 
finding the “stress” all over this surface, and by integrating this stress. In 
other words, there is stress even in vacuum. The purpose of this chapter is 
to articulate how this stress is to be found out. Also it should be noted with 
interest that even empty space is not a true vacuum. When loaded with 
the electric and magnetic fields, space comes under stress. Empty space is 
always buzzing with emission and absorption of virtual particles, with the 
virtual photons mediating the interaction among electrified and magnetized 
objects. Aren’t these virtual photons the new avatar of the aether? 

Secondly, calculating the force on an isolated object A requires exact 
knowledge of the E or B field in which A is immersed. In recognizing these 
fields, one has to be very careful that these E, B fields do not contain any 
trace of the fields contributed by A itself. This is sometimes a challenging 
task. Consider for example the force acting on the surface of a conductor 
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Fig. 6.1. Electrified object in E field. 


carrying a surface charge density o, as in Fig. 6.1(b). The electric field just 
outside the surface is E = (0 /¢9)n where n is a unit normal to the surface. 
One may be tempted to conclude that the force per unit area of the surface 
is F’ = cE = (o7/eo)n, forgetting the fact that an infinitesimal area da on 
the surface contributes the same E field perpendicular to the surface as the 
rest of the surface, so that the true force is 


F = 5F = (0? /2e9)n = (eg E?/2)n. (6.1) 


The stress tensor approach, which uses the total field Eiota1, making no 
distinction between the test object and the source object, will give the right 
result without creating any confusion, as we shall show following Eq. (6.15). 

Thirdly, it is always advisable to arrive at the same answer through 
several alternative routes, if available, just to make sure that we have not 
made any mistakes. The stress tensor provides that valuable alternative 
route. 

And fourthly, Mazwell’s stress tensor, which we shall denote by the 
symbol T, is needed for understanding conservation and flow of momentum 
in the electromagnetic field, which we shall present in Sec. 6.5. When one 
goes deeper into the theory of relativity the same tensor appears as the most 
important component of the energy-momentum tensor required not only for 
presenting a four dimensional and unified view of the conservation of energy 
and momentum, but also for building up the source term in formulating 
Einstein’s field equation for the gravitational field, in his General Theory 
of Relativity. 
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6.2. Maxwell’s Stress Tensor for the Electrostatic Field 
6.2.1. Volume force density in terms of the field 


We shall now construct the stress tensor for the electrostatic field. 
We shall call this tensor Mazwell’s Stress Tensor and represent it by 


the symbol , where the superscript ‘°) implies electric field. 

Figure 6.2 shows a system of electric charges S placed in an electric field 
E(r). In Fig. 6.2(a), the system consists of discrete charges q1, q2,q3,--- 
placed at the radius vectors r;,r2,r3,.... In Fig. 6.2(b), the system is a 
continuous distribution characterized by a smooth charge density function 
p(r) confined within a volume. Our intention is to write the total electric 
force F on this system. 

The force on the discrete system shown in Fig. 6.2(a) is given as follows: 


F=)) gE@\(r;). (6.2) 


Here the sum is over all the charges in the system, and E‘*)(r;) is the 
external electric field at the radius vector r; caused by the presence of all 
other charges lying outside the system S. 

For the case of continuous distribution, shown in Fig. 6.2(b), the indi- 
vidual charges become infinitesimal elementary charges, i.e. g; + p(r)d?r, 
and the sum becomes the integral 


Re / | ie p(r)E (r) dr, (6.3) 


S ds (ext) S- a 
Vee mera ae 3 e Be ye 
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Fig. 6.2. Forces on charges in an electric field. 
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What about the force from the charges inside the system S. They are 
internal forces, and cancel due to Newton’s third law of motion. 

Let Ef") (r,) be the “internal” field caused at rj; by a member particle 
i lying within the system S. Then Fj; = q; EO" (r r;) is the force that the 
member particle 7 exerts on the member particle 7. By Newton’s third law 
of motion, q¢j;E°"" (r;) + qiES™ (rj) = 0. Adding together over all pairs 
for the discrete distribution, and integrating over the entire distribution for 
the continuous distribution we get 


For discrete: B qj B E} pt) -> GB (x 


For continuous: / | [0 r) EO) (p 


In the first equation, the sum symbol a means that while summing 
over i, the term i = 7 (corresponding to the “self field” of the member j) 
is to be avoided. The “internal field” E“"")(r;) is the field at the location 
of the member 7 caused by “all other members” in the system S. In the 
second equation E“")(r) is the “internal field” at the radius vector r, as 
sensed by a tiny volume element d°r at this point. 

We shall add the null contribution shown in the second line of Eq. (6.4) 
to the right-hand side of Eq. (6.3) and write 


= / | I p(r)E (r) dr. (6.5) 


Here E(r) is the actual field at the point r, being the sum of two contri- 
butions, from the (i) external sources, and (ii) the internal sources of the 
system S. 

The purpose of adding the null integral of Eq. (6.4b) to Eq. (6.3) is that 
when we write the force density f, the internal forces need to be added. 
That is, 


(6.4) 


f(r) = p(x)E (x) (6.6) 


is the force on unit volume of the charge distribution at r, in which E (r) 
is necessarily the total field at this location, caused by both external and 
internal sources. Now we manipulate the right-hand side of Eq. (6.6) so as 


to convert pE > V- oe as suggested in Eq. (5.79). This new tensor field 
=(e) 
r 


(r) would represent “stress” in the electrostatic field. 
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Construction of the stress tensor for electrostatic field, magnetostatic 
field and time varying electromagnetic field will be facilitated by the fol- 
lowing identity [22]: 


V- [AA = 544] =(V-A)A—Ax (Vx A). (6.7) 


Before establishing the above identity we shall need a standard formula 
(see, for example, vector identities compiled in Griffiths, 4th edn). 


V(A-B)=Ax (Vx B)+Bx (Vx A) 
+(A-V)B+(B-V)A. (6.8) 
By setting B = A in the above formula, we get 


Vv (54°) =Ax(VXA)+(A-V)A. (6.9) 


We shall now prove the identity (6.7). 


Proof. 


) 
VV: (AA) = (eo -) : (e,e;A;A;), 


=(V-A)A+(A-V)A, (a) 
(a) (Gene) 
De 


one ile 
~ 2 On, =v (547) 


=Ax(V x A)+(A-V)A, by (6.9). (b) 
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The identity (6.7) follows when we subtract line (b) from line (a). 


Note that we have used Einstein’s summation convention introduced on 
. 9 _ 3 _ a3 3 
p. 149. That 1S, ese = Al ese; eje; Aj A; = i pas eje;AjA;, 
etc. 


174 Special Relativity, Tensors, and Energy Tensor 


The stress tensor for the electrostatic field follows when we set E for A 
n (6.7), and use the field equations: V -E = p/e9; V x E= 0: 


= (0) 
f°) = pE=V-T , 
a (@) (6.10) 


where F? = eo[BE — 171]. (b) 


It will be a simple exercise to write the Cartesian components of this 
tensor: 


a(e) 


(ec) (ec) (ec) 


T (T * ey T ss ey T " e,) 
(2 — E? — EB?) Ege, BLE 
=€ EyEx 3(E2 — BE? — E?) EyEz 
poe E,Ey (EB? — EB? — E?) 


6.2.2. Example 1: Stress vector on a plane as a function 
of the angle of inclination 


The stress tensor (6.10) will remain abstract and obscure unless the reader 
works out a few examples. We shall provide two examples of which the 
first one is depicted in Fig. 6.3. A uniform electric field E = Fe, exists in 
a certain region of space. The stress tensor is then given by the following 
expression: 


E? 0 0 
z) = FE’ (eres — yey — e,e,) — = 0 —E? 0 : (6.12) 
0 —E? 


Imagine a plane running parallel to the Z-axis, but inclined to the X-axis 
by an angle 6 (Fig. 6.3(a)). The normal vector is then given as 
sin 6 
n=e,sin@+e,cos@= | cosé |. (6.13) 
0 
The stress vector 7”) on this plane is then 
sin 6 
= (en sin # — e, cos@) = EB —cos@ |. (6.14) 
0 


zi) FO) 
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Fig. 6.3. Stress vector on an inclined plane placed in a uniform electric field. 


Let us consider some special cases: 


T) = — Bey (by setting 6 = 7/2), (6.15a) 
TY = -= E’e, (by setting @ = 0), (6.15b) 
To = -— Be, (same as ie -e,), (6.15c) 

° E0 1 
745°) TE ales —e,). (6.15d) 


Equations (6.15a)—(6.15c) give the stress vectors on the planes identified 
by the normal vectors e,,e,,e, and Eq. (6.15d) gives the stress vector on 
a plane making an angle of 45° with X-axis. We have illustrated these 
points in Figs. 6.3(b) and 6.3(c). We have shown the stress vectors with 
thick arrows, and labelled them with the bold Greek letter T. We draw 
the following conclusion. 


Conclusions: 


(a) If the field is perpendicular to the plane, the stress vector is normal 
and outward (tensile stress), and equal to 2E?. 

(b) If the field is tangential to the plane, the stress vector is normal and 
inward (compressive stress), and equal to 2 E?. 

(c) If the field makes angle 45° to the plane, the stress vector is tangential 
(shear stress), and equal to 2B. 


Case (a) applies to a conductor in an electric field E. The field is per- 
pendicular to the surface. The surface force density is the same as the stress 
vector. We get back the same answer as in Eq. (6.1) using the stress tensor, 
without laboring to find out what is the “external field”. 
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6.2.3. Example 2: Force transmitted between two charged 
particles across a spherical boundary 


We shall obtain the familiar Coulomb force between two charged particles 
using Maxwell’s stress tensor. 

We shall use spherical coordinate system. The reader must have used 
the spherical coordinate system to construct vectors in situations of spher- 
ical symmetry, as in the case of central forces in mechanics. This example, 
and two more the next sections, will give an opportunity to deploy the same 
coordinate system to construct a tensor, the stress tensor to be specific, to 
get at the answer with less expenditure of time. 

We shall first obtain an expression for the E field at any arbitrary point 
P (r,0,¢) located on the spherical surface ©. The point P is at the dis- 
placement vector 7 from A and r from O (Fig. 6.4(a)). In order to avoid 


repeated appearance of the constant ia? we shall set E = -4-€. Note 
that 
n=r—-a=r-aez, (6.16a) 
so that 1? =r? +a? — 2ra cos8, (6.16b) 
and e, =cosée, — sin deg. (6.16c) 
Then 
Qr an 
Qe, q(r _ aez) 
= — + Ss: 6.17b 
r? (r2 + a? — 2ra cos 6)3/2 ( ) 
Therefore, 
E=€E,e, + Epeg, (6.18a) 
Q q(r — acos@) 
hee 2332-2 gp 6.18b 
Bene r2 “ (r2 + a? — 2ra cos 6)3/2 ( ) 
qasin@ 
& = ——_—_— |. 6.18 
° (7? + a? — 2ra cos0)3/? oe) 
From Eq. (6.10), the stress tensor is 
a(e) 1 2> 1 1 > 1 ~(e) 
= €9(EE — -E*1) = —--€*1)= 
=a oo ee Oo eee 
; (6.19) 
where iad = €E— xe 1 


which we may refer to as the “reduced stress tensor”. 
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Fig. 6.4. Electric stress vector on a spherical surface. 


Since we have invoked the spherical coordinate system to write the 
expression for the € field, the components of the tensor ; a will have 
to be written in this coordinate system. Since only r and @ components of 
E are non-zero, the non-zero components of this tensor are 7;,, Tro, Tor, 700; 
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as seen from (6.19). Therefore €? = €? + €7, and we write this tensor as 


Ter Tro 0 
Gq) =| Tor Too O}, where 
0 0 O 
r 9 gr ’ 


Tro = Tor = E, EG, 
1 1 
Tos = EG — ul = 5 (6 == he 


The first column in the square matrix on the left represents the stress 
vector J,. on the spherical surface © (corresponding to n = e,;, analogous 
to the first column in Eq. (5.62). Using the expressions for €,.,€ given in 
(6.18) we shall work out the components of T,. explicitly as follows: 


T, = er T rr = eg Tor, 


_17Q? | @l(r —acosé)? — (asin @)?] 
~ 2 | rt (r? + a? — 2ra cos 6)3 


2Qq(r — cos 0) | 


r2(r2 + a? — 2ra cos 6)3/2 
Qqasiné 


p= €,€5 = ——>——___—_—_ 
vy 0 P2(r2 + a2 — 2ra cos 0)3/2 


qasin 0(r — acos@) 


Se 6.21 
(r2 + a? — 2ra cos6)3 ee) 


The first component 7, is the normal stress on the surface © and the 
second one 79, the tangential (or, the shear) stress. 

In order to illustrate the above equations, and to see how the electric 
field vector E and the Maxwell’s stress vector 7, vary on the surface of the 
imaginary sphere 4), we shall make a numerical example, setting Q = 2,q = 
—l,a=3,r=1 in Eqs. (6.18) and (6.21). The expressions we now get are 
functions of the polar angle 6 only. We have plotted 7,.,., Jo, in Fig. 6.4(b), 
using Maxima. 

In order to show how the field vector € and the stress vector J, vary on 
the surface of the sphere © we have prepared Table 3.1 after evaluating the 
corresponding quantities in the columns 1-9, using Maxima. The angles 
on, appearing in columns 5 and 9 have been explained in Fig. 6.4(c). 
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Table 3.1. & and 7; vectors on the surface of the sphere. 


The first one is the angle between the normal e, to the surface © and the 
electric field E at the surface, and the second one is the angle between e, 
and the stress vector J, on the surface. 


/ &6 
Tor 
T =< Te + Tes ae ae 


We have drawn the field vectors E and the stress vectors 7, on the sphere 
X in Figs. 6.4(d) and 6.4(e) (using two different scales for the two sets of 
vectors). 


(6.22) 


All this tedious work will have been fruitful if we could show that the 
surface force density, when integrated over the entire surface , will give 
us back the familiar Coulomb force between the two charges. The surface 
force density is the same as the stress vector on this surface. We shall work 
with the “reduced” surface force density, same as T,.. 

The Coulomb force of attraction (if Q,q are of opposite signs) or repul- 
sion (if they are of the same sign) will be along the line OA joining the two 
charges. Since this line coincides with the Z-axis, we shall integrate the Z 
component of T;, which we shall denote as fz. We go back to Eqs. (6.16) 
and (6.21) to compute this force, and get the following results after some 
simplification: 


fe = e7° Tx 
= (cos Oe, — sin Beg) - (eT + eg Tor) (6.23a) 
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= cos 67, — sin 0T9,. (6.23b) 
= f.(Q’) + f.(Qa) + f.(@), where (6.23c) 
iQ )= ae cos 8, (6.234) 
~ _ Qaq|r cos 6 — a] 
f:(Qq) = Pe a! — ara cose (6.23e) 
fg) = 1 @[(r? +47) cos0 — 2ra] (6.238) 


2 (r? + a? — 2ra cos) 


The expressions in lines (6.23d) and (6.23f), involving Q? and q?, are 
“self-terms” , whereas the expression in (6.23e) involving Qq is the “interac- 
tion term”. The reader should complete the steps leading from (6.23b) to 
these equations. We shall soon show that the self-terms will vanish upon 
integration, leaving the integrated stress force entirely a function of Qq. 

The “reduced” force transmitted across the surface 1, and hence acting 
on the charge Q, is the surface integral of ce Let us denote this integral as 
F. An area element on © is da = r? sind do. Therefore, 


a, fer’ sin@ do dd 
yy 
= 2nr? "Ze in 6 dO 6.24 
Tr | f,sin (6.24a) 
= 2nr?[Z(Q*) + T(Qq) + L(a°)), (6.24b) 
where Z(Q =f f.(Q?) sin 6 dO = 0, (6.24c) 
L(Qq) = [ f.(Qq) sin 6 dd = a (6.24d) 
= f.(q’) sin 0 dO = 0. (6.24e) 
0 
e ArQ 
Hence, F = — —. (6.24f) 


The integral given in (6.24c) is easy to evaluate. The other integrals 
have been worked out in the Sec. B.1. They can be worked out more easily 
using Maxima with a computer. 
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To get the true force we go back to (6.19), multiply F with the factor 
Tn and get the force Fg acting on the charge Q: 
i. Qq 


= oe 7 6.25 
1672e€9 : neues” ( ) 


FQ 


This force is the familiar Coulomb force on the charge @ located at the 
origin, exerted on it by another charge q located at a distance a on the 
positive Z-axis. It is repulsive, i.e. towards the negative Z-axis, if Qq is 
positive, and attractive, i.e. towards the positive Z-axis, if Qq is negative. 


6.3. Maxwell’s Stress Tensor for the Magnetostatic Field 


This section is the magnetostatic analogue of the electrostatic stress ten- 
sor presented in Sec. 6.2.3. The steps are parallel, so that we shall avoid 
detailed explanation. 


6.3.1. Volume force density in terms of the field 


We shall construct Maxwell’s stress tensor for the magnetostatic field, rep- 
resent it by the symbol ile The volume force density in a magnetic field 


is f™ = J x B. Therefore, we need to construct the tensor a under 
the specification 
VF =1™ <I xB. (6.26) 


This is now an easy task, thanks to the identity (6.7) we had established 
in Sec. 6.2. We set B for A in that equation, and use the field equations: 
V-B=0; V x B= pW0J, leading to: 


PY=TxkB=V-7". w 
(6.27) 


where FT = [BB — $B" . (b) 


1 
Lo 


Note the similarity between the stress tensor (ie written above and the 


stress tensor ‘a written in Eq. (6.10). The former converts into the latter 
if we replace E with B and ¢€9 with _ In the same way the matrix form 


given in Eq. (6.11) converts to the matrix form of er , Consequently, the 
stress vector changes from normal outward, to tangential, to normal inward, 
as the angle between the plane and the direction of the B field changes 
from 90° to 45° to 0°, as shown in (6.15) and illustrated in Fig. 6.3, and 
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Fig. 6.5. Magnetic stress vector on a spherical surface. 


the “Conclusion” written on p. 175 carries over to the case of a magnetic 
field without any change. Each point in the conclusion is well illustrated in 
Fig. 6.5 (see next section) if the reader compares the direction of the field 
vector B in Fig. 6.5(d) with the direction of stress vector J; in Fig. 6.5(e). 
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6.3.2. Example 3: Force transmitted between two magnetic 
dipoles across a spherical boundary 


The smallest denomination of the source of a magnetic field is a magnetic 
dipole, consisting of a tiny current loop. We shall therefore think of the 
force between two magnetic dipoles. We have placed these dipoles along the 
Z-axis, oriented them in the positive direction of this axis. Figure 6.5(a) 
shows the geometry of this configuration. The dipoles are shown by tiny 
spherical blobs with an arrow pointing in the direction of this vector. As in 
the electrostatic example, we shall illustrate Maxwell’s stress tensor ais 
by finding the stress vector on the surface of an imaginary sphere » of 
radius 7 surrounding the point magnetic dipole M which is placed at a 
distance a from the other point magnetic dipole m such that r < a, and 
then integrate this stress vector over the spherical surface to obtain the 
force Fy, on M exerted by m. 

We shall first obtain the B field at any arbitrary point P (r, 0, ¢) located 
on the spherical surface /, at the displacement vector 7 from A and r from 
O. In order to avoid repeated appearance of the constant 4°, we shall set 
B= 2B, and use Eq. (6.16). 

Let BO) (r,0,6), and B(r,0,¢) be the fields? produced by the 
dipoles M and m respectively, at any coordinate point (r,6,¢). Adding 
them we get the total field B(r, 0, ¢): 


B(r,0, 4) = BO (r, 0,4) + B™ (r, 8, 4), (6.28a) 
v7 M 7 a M 2 
B) (r, 0,4) — wee = Be, + BoMes, (6.28b) 
2M M si 
where BO“) = a BY = — (6.28c) 
r r 
3 : _ m 
B' (r, 6, ¢) = cs) ees = B™e, + BS Jeg, (6.28d) 
n 
2 2 2 6 — ) Q 
wee Bo) = See es ee el (6.28e) 
1) 
Bi”) _ m(r? — 2a? + ar cos) sin (6.288) 


7 


bSee [14, Eq. (3.89)]. 
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For future convenience, we write 


B= B,e,+ Boeo, where 
M m 
B,. —= (=) Ose (=) b, 


Bo = (=) y+ (=) 6. (6.29) 


From (6.28): a@=2cos0, 8 = 2(r? +.a”)cos@ — (3 + cos? @)ar, 


y=sinO, 6 = (r? — 2a? + arcos6)sin8. 


From Eq. (6.27), the stress tensor is 


ae te = 5°) me (ss _ 561) 


Lo 2 16x? 
— Ho z(m) 
=r, (6.30a) 
ie ee 
where 7°” = BB - 5871, (6.30b) 


which we may refer to as the “reduced stress tensor”. The non-zero com- 
ponents of this tensor needed by us are 


1 1 
Ter = Br — 5B? = 5 (Bp — Bg); Tro = Tor = BrBo. (6.31) 


In order to illustrate the above equations, and to see how the magnetic 
field vector B and the Maxwell’s stress vector oi look like on the sur- 
face of the imaginary sphere surrounding the charge Q, we shall make a 
numerical example, setting M = 2,m = l,a = 3,r 1 in Eqs. (6.29) 
and (6.31). For this purpose, we have prepared Table 3.2, after evaluating 


the corresponding quantities in the columns 1-9 using Maxima. The angles 
és, dr appearing in this table have been explained in Fig. 6.5(c). See also 
Eq. (6.21). 

We have plotted 7;,, 7g, as functions of the polar angle @ in Figs. 6.5(b), 
using Maxima, and have drawn the vectors B and 7,. on the sphere © in 
Figs. 6.5(d) and 6.5(e) (using two different scales for the two sets of vectors). 

All this tedious work will have been fruitful if we could show that the 
surface force density, when integrated over the entire surface %, will yield 
the same force between the two dipoles that we can calculate using the 
standard formulas of magnetostatics. Let us then first apply the “standard 
formula” . 
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Table 3.2. B and 7; vectors on the surface of the sphere. 


1 2 3 4 5 6 7 8 9 

6 B, Bo B és Trr Tor si or 
0° 4.25 0 4,25 0° 9.03 0 9.03 0° 
30° 3.58 0.86 3.69 13.5° 6.05 3.09 6.80 26.9° 
60° 2.00 1.64 2.59 39.3° 0.66 3.28 3.34 78.5° 
90° —0.03 1.96 1.96 89.4° 1.91 0.06 1.91 Lee 


The force F,, on m is given by the formula F = (m- V)B, in which B 
is the field created by M. The m vector is in the Z-direction. Therefore, 
m:-V= me, which means that we can treat the (x,y) coordinates as 
constant and equal to zero. Therefore, 


OB 
F,, Mm >— ? 
Oz xr=y=0,z=a 
M 2 2 
where, B(0,0,z) = o E : 7 Je 
. 7 (6.32) 
OB 3u0M 1 
Oz ae ae 27 at 
Hence, F,, = ene 
2rat 
By Newton’s third law of motion, 
30M 
i, =<6-= ae. (6.33) 


Now we shall calculate the same force using the stress tensor. The sur- 
face force density is the same as the stress vector on this surface. We shall 
work with the “reduced” surface force density, same as T,.. 

The force of attraction between the dipoles will be along the line OA 
joining them, which lies on the Z-axis. Therefore, we need the Z component 
of the surface force density fA 


i, =e,:7T,. = (cos6e, — sin deg) - (e; Tr, + e97o,) = cos OT, — sin OTo,. 
(6.34) 


We shall break up this force density into three components: (1) f,(M?) 
representing self-term for M, (2) f.(d/m) representing interaction term 
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between M and m, (3) f.(m?) representing self-term for m. From 
Eqs. (6.29), (6.31) and (6.34): 


= page @ 2 oe ap ee 
f.(M’) = [(a? — 77) cosé — 2a7sin 6] ae (6.35a) 
PGi) = (ab eos = CE HOGA ae (6.35b) 

- ae 
fz(m?) = [(8? — 6°) cos — 286 sin 6] 310" (6.35c) 


The “reduced” force F transmitted across the surface D, and hence 
acting on the dipole M, is the surface integral of fs which is the sum 
of the integrals of f.(M?), f.(Mm), and f.(m?). Each integral is difficult 
to evaluate, because a, 3,y,6 are complicated functions of r,a,@. We have 
evaluated these integrals using Maxima. See Appendix B. The result is as 
follows: 


F= /I fer? sin 6 dé dd = 2ar? f f.sin 6 dé 
>»; 0 
= Inr?[Z(M?) + T(Mm) + Z(m?)|, 


where Z(M?) = | f.(M) sin 6 dO = 0, 
0 


T 12Mm (6.36) 
z(Mm) = [ f.(Mm) sin 0 dé = gare? 
L(m?) = f.(m?) sin 6 dé = 0. 
) 
Hence, F= — 
a 


Because of the relation (6.30) the true force F,, acting on the dipole 
M is 55> times the force F’. Hence 


= 3u0oMm 


Py = Zz 6.37 
Qrat . ( ) 


We have thus verified that the stress tensor has given us the same force 
that we obtained in Eq. (6.33) using standard formulas of magnetostatics. 

We have worked out three examples to bring out the meaning of 
Maxwell’s stress tensor for electric and magnetic fields. The reader may 
wonder why we should go through such a tortuous road to get answers 
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that can be easily obtained using simpler formulas of electrostatics and 
magnetostatics? Isn’t it like demolishing a mud wall with a cannon? 
Every cannon needs a mud wall to ensure its trust-worthiness before 
deployment in a true situation. Maxwell’s stress tensor is destined to play 
a bigger role, in constructing the conservation equation for field momentum, 
and later under the watchful eye of Special Relativity, in building up the 
covariant expression for conservation of energy and momentum. The three 
examples we have worked out were intended to be an intellectual exercise to 


instill confidence in the mathematical expressions of | il and eo before 
crowning them for their majestic role. 

Our next example is not a mud wall. It shows how Maxwell’s stress 
tensor can solve a difficult problem directly. 


6.4. Example 4: The Force Between Two Hemispheres 
of a Charged Sphere 


Consider a uniformly charged sphere of radius R, and carrying a total 
charge Q. What is the (repulsive) force that the lower hemisphere exerts 
on the upper hemisphere. 

Finding the force by a naive application of Coulomb’s law can be 
difficult. 

The solution of this problem can be found in Griffiths.° However, 
Griffiths employs Cartesian system. We shall use the spherical coordinate 
system to obtain the result compactly. We have illustrated the geometry 
in Fig. 6.6. 

We divide the boundary surface into two parts: (1) the upper surface 
Stop, on which the normal vector is e,, (2) the lower surface Spottom on 
which the normal vector is —e, = eg. The E-field is radial on both. We 


7 


shall find the stress vectors Tt“), 7°), and their normal components 
on both surfaces, and by integrating, shall get the answer. Let us first 
get the stress tensors on the top and the bottom surfaces. The (reduced) 


electric fields are as follows: 


et) — Gon g(r) — BT, (6.38) 


°See [14, p. 368]. 


188 Special Relativity, Tensors, and Energy Tensor 


Shottom 


Fig. 6.6. Charged hemisphere. 


Using Eq. (6.19), the (reduced) stress tensor takes the following forms: 


=(t) @? a ~(b)  @Q?r? 2 
To =sm (9 -1 0], T°=Se]o0 -1 0 |. (6.39) 
oo) =! 0 0 =-1 


Note that the normal to the top surface is n = e,, and the normal to the 
bottom surface is n = —e, = eg. Since the net force is in the z-direction, 
we shall consider only the (z,1r)-component of the stress vector on the top 
surface, and the (z,@)-component on the bottom surface 


rit) =e, - qq -e, = (cos de, — sin Beg) - ql -e,; 
2 
= cos 67, = _ cos 0, (6.40) 
~(b =(b 2b 272 
oy er eee a "6g ai 


a) 


It is seen from the last equation that the stress vector is pointing into 
the volume above the surface, though the field € is parallel to the surface. 
This may appear strange on first sight, but conforms to Eq. (6.15), and the 
conclusions following them. 
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Integrating the stress vectors given in Eq. (6.40) over the respective 
surfaces, we get the force on the upper hemisphere. 


2 2 
(t) _ E0Q 2: _ Q°a 
F RA [ f,cosar sin 6 dé do DR?” 


ib) — £00" ie pepe Se 


(6.41) 


2R6 2Re * 4 AR?° 


Adding the above two forces and multiplying with wre (see Eq. (6.19)), 
we get the total force: 


i 320? 
= ————“¢,. 6.42 
Ate 16R2~ ( ) 
6.5. Maxwell’s Stress Tensor for the Electromagnetic Field 
and Momentum Conservation 


We had introduced Maxwell’s stress tensor for static electric and static mag- 
netic fields, with suitable applications, in Secs. 6.2 and 6.3. These applica- 
tions demonstrated that the force acting on static distributions of electric 
charges and currents lying within a bounded volume Y is equal to the stress 
vector integrated over the surface S bounding this volume. The attribute 
“static” implied that the objects considered in our discussion, e.g. isolated 
charges and isolated current carrying loops, were fixed with a kind of “glue” 
making them immobile in spite of the electric and magnetic forces acting 
on them. We shall now remove that glue and see what role can now be 
played by the same stress tensors. 

At this point, we shall make a subtle distinction between force and 
stress. Force acts on material objects which may be discrete charged par- 
ticles or a localized continuous material media, e.g. a plasma. The stress 
considered here acts on the field, which is a kind of ethereal medium, as 
conceived by Maxwell and his contemporary physicists. In the absence of 
any glue holding them, the charges (e.g. electrons, nuclei) and currents 
(e.g. current loops) will be free to move and gain momentum. However, 
the momentum need not be confined to material objects. It can be shared 
by the field as well. Therefore, we shall make the following conjecture. 


Conjecture 6.1. There exists a Maxwell’s stress tensor (aie for the elec- 
tromagnetic field, and it is given as 
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such that 


s U/L» #r) + f[fre) ~ I T°" n(r)da, (6.44) 


where g and P are, respectively, the field momentum density and the mate- 
rial momentum density, the latter being governed by Newton—Minkowski-— 
Lorentz-force equation 


“ = pE+JxB. (6.45) 


Moral: It is seen from (6.44) that the force transmitted by the Maxwell 
stress tensor Tests across a closed surface S contributes to the total momen- 
tum inside the volume V (bounded by the same surface), and has two parts, 
namely, the mechanical part and the field part. 

In Appendix A.3, we have shown all the 3 x 3 components of ‘i 
explicitly. 

The right-hand side of Eq. (6.44) gives the stress transmitted across the 
boundary S. The right-hand side of Eq. (6.45) gives the density of Lorentz 
force acting on all charged matter lying within the volume VY. We shall 
convert the surface integral on the right-hand side of (6.44) into a volume 
integral, using Gauss’s theorem (see Sec. 5.2.4) so that each term in this 
equation is a volume integral, and then remove the integral sign reducing 
the same equation to an equality among three density functions: 


Og OP =(em) 
ay + aE = -T , (6.46a) 
oe 78 4 B+ IXBHV-PO 40 P™, (6.46) 


We shall now show that the above conjecture is right, that starting 
from Maxwell’s equations we are able to find an expression for the field 
momentum density such that the momentum conservation of matter and 
field together falls into the scheme suggested in Eq. (6.46). Our task is 
made simple by the identity (6.7) we had established in Sec. 6.2. We shall 
do the work in two stages: (1) set E for A in (6.7), and use Maxwell’s 


equations: V-E = p/e9; Vx E= — 8 (2) set B for A and use Maxwell’s 
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equations: V-B=0; V x B= po(J + eo). Hence, 
zike oe 
VT =V-é [ep - 50i| =<e9|((V:- E)E—-Ex (V x E) 


OB 
= pE+ eoE x — 


Or? (6.47a) 
=>(m) 1 1 ios 1 
V:-T =V-—|BB- =B71| = —|[(V-B)B- Bx (V xB) 
Ho 2 Ho 
OE OE 
=-Bx (J+ co57) =IxBteos xB, (6.47b) 
a 2 (coE x B)+(pE+J xB). (6.47c) 


Equation (6.47c) is obtained by adding Eqs. (6.47a) and (6.47b), and 


using definition of ‘aia as given in (6.43). It confirms validity of our 
conjecture and identifies the field momentum density as 


g = €0(E x B). (6.48) 


We shall like to recast Eq. (6.46a) into the general format of the con- 
servation equation 


2 (volume density) + V - (flux density) = 0. (6.49) 


In this case, the momentum flux density eo is to be identified as 


(em) _ += (em) 


=-J . (6.50) 
Equation (6.46a) now reads like a true momentum conservation equation: 


O on 

es ee oo aa (6.51) 
It may be easier to comprehend the meaning of the above conservation 

equation by writing its three Cartesian components. For example, the z- 

component of the above equation will be 


OP,  OGx = 
a ta FV a2 =0, (6.52a) 
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= (em) 


where ©, =@-e, =-T (6.52b) 


= —€ ee E} — E?) + ey ByE, + e,E, Br] 


1 
e 25 (Be - Bi B2) + eyByBs + e.B.Bs|. (6.52c) 


The first two terms in Eq. (6.52a) give the rate of increase of the 
x-component of total momentum (consisting of field momentum and mate- 
rial momentum) per unit volume, the third term gives the rate of outflux 
of the z-component of the field momentum per unit volume. Conservation 
of momentum implies that the sum of the two must be zero. 

Before leaving this topic let us recall the expressions for the field energy 
density w and the field energy flux density S (i.e. the Poynting’s vector, 
Sec. 12.3) 


w = —|E*?+c?B?] Field Energy Density, (6.53a) 


eo 

2 
S =eoc[E x cB] Field Energy Flux Density. (6.53b) 

It is immediately noticed that 

S= 9. (6.54) 


When the electromagnetic field is a radiation field, E = cB and ExcB = 
E*n where n is the direction of the Poynting’s vector, giving the direction 
of the flow of radiation energy. For such radiation fields, 

w=eob?, S=cun, g= =n, w=cg. (6.55) 
The last equality is a reminder of the relation E = cp between the energy 
FE and the momentum p of a photon. 

We are still not too clear about the true meaning of the momentum flux 
density ®. To get familiarity with it let us consider a plane electromagnetic 
wave propagating in the x-direction, polarized in the y-direction. For such 
a field E = He,, cB = Ee,. It is a simple exercise to evaluate ® by setting 
bE, = 0, Ey = BE, E, = 0; cB, = 0, cBy = 0, cB, = E in the expression 
for ®, in Eq. (6.52c) and similar expressions for ®,, ®, and obtain 


® — @,e, + ®,e, + ®,e, = (eo Bez )ex = cgex = gc. (6.56) 


Here c = ce, represents the “velocity” of light, being the speed c multiplied 
with a unit vector in the direction of propagation. If we now consider a 
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plane perpendicular to the X-axis, so that n = e,, then the outflux of field 
momentum per unit area across the plane will be @-n=6. e, = cg. 
Generalization of Eq. (6.56) is obvious. If there is a source of radiation 
at the origin (say, an antenna, or an accelerating charged particle), then far 
away from the origin, the momentum flux density tensor ® has the form 


& = Cgerer = gce, = gc, (6.57) 
where e, is the unit vector in the radial direction, also identified with the 
direction of propagation of the electromagnetic wave. The tensor ® gives 
the measure of how much momentum is crossing a spherical surface per 
unit area per unit time. The momentum density is g = Le,, and it is 
propagating in the radial direction with velocity c = ce,. 
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Part ITI 
Physics in Four Dimensions 
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Chapter 7 


Space—Time and Its 
Inhabitants 


7.1. World Line in Space—Time 


It was noticed from the Lorentz transformation formulas derived and writ- 
ten in Secs. 3.1 and 3.2 that the space coordinates and time coordinates 
of any event @ in the frame $” is a linear combination of the space and 
time coordinates in S, and vice versa. Therefore, in a certain sense, the 
borderline between the space coordinates on the one hand and the time 
coordinate on the other, looks blurred in Relativity. We fancy therefore a 
four-dimensional world of events where the time coordinate takes an equal 
status along with the three space coordinates. This composite world, inte- 
grating time with space, is called Space—Time.* 

Space-time needs four axes, namely, the time axis cT’, and the space 
axes X,Y, Z, and four coordinates (ct, x,y,z). It will be convenient to label 
the coordinate axes as X“, and write the four coordinates of an event as 


(x), with « = 0 for the time axis/coordinate ct, and uw = 1,2,3 for the 
2 


space axes/coordinates x, y, z, respectively. That is, (x° = ct, 21 = 2, x 
y, x? = z). We shall follow this convention in this book. Also, we shall use 
Greek indices, e.g. u,v,a, 2 to mean all the four coordinates, and Roman 
indices, e.g. 2,7, to mean only the three space coordinates. For example, 
we may write (#") = (x°; 2") = (ct, 2, y, 2). 

Note that we are using the coordinate index p as a superscript, i.e. as 
a contravariant index. The reader will understand the reason in Sec. 7.8, 


4A detailed exposition of the original paper of Minkowski can be found in [41]. 
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Fig. 7.1. Events A, B, C on space-time diagram: (a) Two space axes X,Y shown; 
(b) single space axis X shown. 


where we shall make a distinction between a contravariant 4-vector and a 
covariant 4-vector. 

In Fig. 7.1, we have presented a view of the space-time, which 
we shall refer to as a space-time diagram (or ST diagram), and dis- 
played three events along with their coordinates: A= (x9, 21,27, 23), 
B=(a2?, x}, 22,23), C = (2°, x1, 22, 03). In Fig. 7.1(a), we have suppressed 
the Z-axis, so that we can show events on paper, and marked the coordi- 
nates on the respective axes. In Fig. 7.1(b), we have made the ST diagram 
simpler by suppressing both Y- and Z-axes, exposing only the X-axis and 
the time axis. 

The above mental construct of the four-dimensional world will not 
diminish the role of the familiar world of pure space dimensions X,Y, Z. It 
is in this space that we see objects like satellites, planets, locomotives. We 
shall call this space the physical space. 

In Fig. 7.2(a), we have presented another view of space-time for describ- 
ing the motion of a particle which is confined to move only on the X Y-plane. 
C is its physical trajectory, and P is one point on it. 

On the other hand, the trajectory of the particle in space-time, shown 
as Q, is called the world line of the particle. The event that “the particle 
has reached P” is presented by the point Op on the world line 2, and is 
called a world point of the particle. 

A particle that does not move at all in a given frame of reference S, 
still moves continuously along its world line, as depicted by the straight 
line %, directed upwards, because the time clock is continuously ticking. 
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Fig. 7.2. World line and light cones. 


The world line of a photon (i.e. a light quantum often represented by the 
symbol 7) propagating along the X?-axis must make an angle of 45° with 
the X°-axis, as illustrated by the straight line. In fact, one can construct, 
at any event Op, a light cone Ap whose surface will make the angle of 45° 
with the X°-axis. 

Figure 7.2(b) presents a better picture of the light cone and its sig- 
nificance. At any event point Op the light cone carves out a region of 
space-time as Future, and another as Past, both of them confined within 
the light cone, the Future occupying the upper part, the Past occupying 
the lower one. 

Standing at the event Op I am entitled to have information of events 
that occurred only inside the “Past” segment of the light cone, everything 
outside remaining beyond my knowledge. In the same way, all future events 
which will originate from Op, i.e. whose world lines will pass through Op 
will lie within the “Future” segment of the light cone. The reason for this 
conclusion is that all information/knowledge is received/gathered through 
messengers that move with velocities that would never exceed the speed 
of light c. The fastest messenger is light, or radio signal, moving with the 
speed c. 

The world line of a photon, marked T, having Op as a world point, must 
be grazing the light cone, passing through two points A and B, lying on the 
Past segment and on the Future segment, respectively. Similarly, the world 
line of a material particle, marked Q — progressing from the past to the 
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future through the event O p — must lie inside the light cone, as illustrated 
in the figure. 


7.2. Hyperbolic World Line of a Particle Moving 
Under a Constant Force 


A relativistic particle moving under a constant force undergoes a constant 
acceleration a with respect to its instantaneous rest frame, as we had found 
out on p. 97. If the force is F, and the rest mass of the particle is m,, then 
a =F/m.. 
Consider a constant force F acting on a particle in the X-direction. 
A good example can be a charged particle placed in a uniform electric field 
E = Eoe,. Let us assume that at t = 0 this particle is instantaneously 
at rest, and located at the origin O, in a certain frame S. Then the (2, ct) 
coordinates of this particle are given in this frame, as functions of the proper 
time T, as (see Eqs. (4.104) and (4.105)) 
o> € [cosh = — 1| S eh a [sinh “ ; (7.1) 
c a é 
The above parametric equation of the world line transforms into the 
familiar equation of a hyperbola, involving only the space and time 


coordinates: 
2 
(a + p)? — (ct)? =p”, where p= ~ = unit length. (7.2) 
a 


We thus get a hyperbolic world line. 

We have used Gnuplot to plot the hyperbolic world line, represented by 
Eq. (7.2) in Fig. 7.3, in which the X°- and the X+-axes are each graduated 
in the scale of p = 1 unit. 

We have highlighted a few important features of the world line in 
Figs. 7.3(a) and 7.3(b). The physical trajectory of the particle is +00 > 
AOB- +o0, ie. a directed straight line merging with the X!-axis, revers- 
ing its direction at O. The particle comes from infinity with velocity ~ —ce,, 
along the X1-axis but in the negative direction, decelerates due to applica- 
tion of the force in the +e,-direction, stops momentarily at the origin O, 
then turns back and returns to infinity with velocity + +ce, along the same 
X-axis, but now in the positive direction. A, O and B are three points on 
this axis reached by the particle at certain times during the inward and 
outward and journey. The corresponding events are O4, Oo and Og. 

In Fig. 7.3(a), we have drawn a single light cone at the event Oo, and 
in Fig. 7.3(b) at each of the three events 04, Og and Og. Note that as the 
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x? x? 


Light Cone|—Future 
1 
4 ‘ | ‘ ‘ 


S21) 12 5 4 6 


Light Cone — Past 


(a) (b) 


Fig. 7.3. World line of a charged particle under a uniform electric field. 


velocity of the particle approaches c, its world line almost grazes the light 
cone, but never goes outside it. 

We shall top up the above exercise with some realistic numerical esti- 
mates. Let us first define a characteristic time 7, such that the particle 
would reach the velocity c in this time, if non-relativistic mechanics had 
been applicable. That is, 


QT =C, OT =C/a. (7.3a) 


a 
Hence, p= 7 oT (7.3b) 


We shall consider a charged particle, e.g. an electron, which is accel- 
erated in a 30m long linear accelerator (e.g. pelletron) to 30MeV. The 
electric field through which the particle is accelerated is assumed to be uni- 
form, and equal to E = 10° V/m. The charge and mass of the electron are 
e=1.6 x 10-'°C, mp = 9.11 x x107%! kg. The acceleration is then 


eE 16x 10-19 x 108 
mo «9.11 x x 10-34 


do that: m= (3x 10") (017% 10) =17:6 % 107" s. 


017 x10" m/e" 
(7.4) 


Hence, p=17.6x 1071? x 3 x 10° = 0.528 m. 
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We go back to Sec. 4.4, copy formulas (4.58) and (4.59) which give the 
velocity and displacement the particle: 


ee 
1+ (#) 
2 (7.5) 
r= p 1+ (=) —l|, 
To 
and make the following estimates: 
1 
at t = T, = 3.416 x 1071", v = —=c=0.707¢, 
V2 
x = (V2 —1)p = 0.414 9; 
—10 4 
at t = 27, = 6.832 x 107s, v= 4/-c=0.89c, 
5 (7.6) 


x = (V5 — 1)p = 1.236 p; 


att =327, = 10.248 x 10-" s, vey! = c=0.95¢, 
x = (V10-— 1)p = 2.162 p. 


It is then seen that at t = 37, = 10.248 1071°s. the particle has traversed 
2.162 units of distance = 1.41 m from the origin, and has gained a speed of 
0.95 c. We have marked this point on the X!-axis with an upward arrow f. 


7.3. Lorentz Transformation in Space-Time 
7.3.1. Graphical procedure 


How to represent Lorentz transformation in space-time? We need the 
answer for a better understanding of Special Relativity. In this section, 
we shall demonstrate Minkowski’s graphical construction of Lorentz trans- 
formation, and use this construction to resolve the paradoxes of length 
contraction and time dilation. 

We have explained the procedure in Fig. 7.4. In order to make the 
drawings less clumsy we have replaced the (X!, X°)-axes with (X, Y)-axes, 
and the (a',2°) coordinates with (a, y) coordinates. We shall explain the 
procedure in two steps. 


Step 1: SET UP THE COORDINATE AXES (X’, Y’), WITH SCALES SHOWING 
TIC MARKS AT UNIT INTERVALS. 


Space-Time and Its Inhabitants 203 


Fig. 7.4. Graphical construction of Lorentz transformation. 


The first part is shown in Fig. 7.4(a). Draw the X — Y-axes (perpen- 
dicular to each other) with the origin at O. The straight line OY’ making 
an angle ¢ with the Y-axis (¢ < 45°) is the new Y’-axis of transformation. 
I is the hyperbola: y? — 2? = 1 and VW is its asymptote: y = a. A is the 
point of intersection of [ with the Y’-axis. 


Y‘-axis: y = [tan(a/2 — ¢)] « = xcot¢, 

Lr: y?—2?=1, (7.7) 
1 

Intersection A: (2,4, y4) = ——==————— (sin ¢, cos ¢). 


cos? @ — sin? 


Now scale the Y’-axis by defining one unit as the intercept OA, equal 
to 


1 
Ay = Va +R = ——— —, (7.8) 
cos? @ — sin* 
to be called scale factor. Now draw the straight line #1, which is tangent 
to the hyperbola at A, interacting the asymptote WU at B. 


d 
tangent: oo) _ ra/Ys = tan ¢, 
A 
#1: Y —Ya = (a —ax,) tang, 
t: | (7.9) 


Intersection B: (@5, ys) = ew 1). 
— sin 
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Now complete the parallelogram OABC. The line OC extended onward 
is the X’-axis. The point C lies at the intersection of the X’-axis and the 
line # 2. 


#2: Y — Ys = (u4— 25) cot, 


/ Tae — 
X-axis: y = xtan@, (7.10) 
1 

2 


cos? @ — sin* @ 


(cos ¢, sin @). 


Intersection C: (ao, Yc) = 


Now scale the X-axis by defining one unit as the intercept OC, equal 
to 


dy = VRE = (7.11) 
cos? ¢ — sin? ¢ 

Note that A,, = A,. Therefore, the two axes have a common scale factor 
N= Aw = dy = 1V14 

where B=tan¢d<1, y= vo (7.12) 


For the actual graphical construction of the LT, we have taken ¢ = 15°. 
This gives 6 = tan @ = 0.2679; y = 1.038; A = 1.074. 


Step 2: TRANSFORM THE (2, y) COORDINATES OF A POINT P TO (2’,y/’), 
USING THE SCALE FACTOR A. 

We have explained the steps in Fig. 7.4(b). Let us write the coordinates 
of the point P as (a9, yo). The straight line © drawn parallel to the Y’-axis 
and passing through P intersects X‘-axis at Q, and the straight line © drawn 
parallel to the X’-axis and passing through P intersects Y’-axis at R. Then 
Q and R are the projections of P on the X’- and Y’-axes, respectively. Let 
us find the (x,y) coordinates of Q and R: 


O: ¥ — Yo = (x — xo) cot ¢, 
X'-axis: y= xrtand, (7.13) 
Intersection Q: (%Q,Yo) = Ay(“Lo — Byo)(cos ¢, sin ¢), 


where we have used the identity 


cos $(1 — tan? ¢) = a (7.14) 
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Similarly, 
2: y — yo = (% — xo) tan g, 
Y-axis: y = x(cot 9), (7.15) 


Intersection R: (ar, Ye) = AY(Yo — Bx0)(sin ¢, cos ¢). 


The coordinates (x, yo) of the point P, with respect to the X', Y’-axes 
are now calculated in the following way: 


L = 0Q = \/22, + y2 = (x0 — Byo); (7.16a) 
K = OR = Vx2 +92 = yo — Ba0), (7.16b) 
ty = L/A = y(x0 — Byo), (7.16c) 
yo = K/A = (yo — Bao). (7.16d) 


Note that we have not used any principle of relativity in the above geo- 
metrical construction. It was a mathematical exercise in coordinate trans- 
formation (x,y) > (a',y’) with the final result: 


y’ = (y — Ba), 1 
1; y= ——. rales 
eo) ere J1— B? en 


This result is identical with the Lorentz transformation formulas derived 
in Eqs. (3.8). In this sense, the construction presented here can be called 
a Graphical Construction of Lorentz Transformation. 


7.3.2. Graphical construction of length contraction 


We shall resolve the paradox of length contraction graphically using space— 
time axes of the S and S’ frames, as depicted in Fig. 7.5. It is a partial 
copy of Fig. 7.4(a) with the following important differences: (1) changed 
the (Y, X)-axes to the (X°, X')-axes of the S' frame, and similarly (Y’, X’) 
to (X°, X"’) of the S’ frame; (2) expanded the axes so that the tic marks 
are further apart. 

However, since we shall depend on the construction shown in Fig. 7.4(a), 
we shall use the same coordinates used there, namely, (y, x); (y’,x’), syn- 
onymously with the relativity coordinates (2°, x!); (a, a1’), respectively. 

A meter stick, i.e. a rigid rod of unit length, is lying along the X"-axis of 
the inertial frame S’ which is moving relative to S with the velocity 6c in the 
X-direction. L and R represent its left and right ends. The world lines of 
these two ends are shown as the straight lines LM and RN. The “world view” 


206 Special Relativity, Tensors, and Energy Tensor 


Path of the meter stick in physical space 


Fig. 7.5. Graphical construction of Lorentz contraction. 


of the rod at 2°” = 0 is shown as a thick solid line of unit length, and at 
four other values of 2° as thick, but broken lines, also of unit length. 

Let us first understand what we mean by the length of the rod in S, 
with respect to which it is moving. We had provided the answer in the 
paragraph following Eq. (2.26) on p. 41. It is the distance between the 
points L and R°, marking the left and the right end of the rod on the X!- 
axis simultaneously, i.e. at same the instant 2° = 0 (same as t = 0), as the 
rod was speeding away along this axis. We have shown the motion of the 
rod in the lower diagram. 

Referring to Fig. 7.4(a), let R° be the intersection of the world line #2 
(i.e. the straight line CB) on the X1-axis. The intercept @, in Fig. 7.5, is 
the x! coordinate of R°, and represents of length of the stick in S$. We shall 
calculate @ using the (vs, ys) coordinates of B from Eq. (7.9). 


#2: Y — Ys = cot d(a — a5), 


X-axis: y=0, 


0 cos @ + sing 
At R®: f= a, — tandys = (1— By) a sing 7 


(7.18) 
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Conclusion: The length of a straight rod which is 1m long in its rest frame 
is measured to be ¢ = 1/7 in the frame S with respect to which it is 
moving with velocity c@ parallel to its length. We have thus derived the 
length contraction formula by graphical construction. 


7.3.3. Graphical construction of time dilation 


We have explained the construction in Fig. 7.6. Again, the axes are the 
same as in Fig. 7.4(a), but the axes have been expanded even further so 
that the tic marks # 1 on the axes appear near the margins. 

We have presented two events O and A, both occurring at the same 
spatial location in S’, or, to be more precise, at the same space coordinate 
gl = 0, but at two different time instants, separated by a time interval of 
one unit in the frame S$’. This means that 2°’ = 0 for O, and 2°’ = 1 for A 
(as defined while writing Eq. (7.8)). In this sense, the proper time between 
O and A is 1 unit. 

We want to find out graphically the time interval between the same 
two events in the frame S. The procedure is very simple. Just find out 


Fig. 7.6. Graphical construction of time dilation. 
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cos 


cos? @— sin? ¢ 


the y coordinate of A, from Eq. (7.7): ys = = y. Hence the 


conclusion: 


Conclusion: If the time interval between two events O and A occurring at 
the same spatial coordinates in a frame S” is 1 unit (so that the proper time 
between the events is one unit), then the time interval between the same 
two events as measured in the frame S which is moving uniformly relative 
to $’ with velocity c3, will be y units. 

Note from the figure that there is a certain spatial separation of @ 
between the two events in S. 


7.3.4. Simultaneity, or absence of it 


On the same diagram presented as Fig. 7.6 we have tried to resolve the 

paradox surrounding simultaneity. Three events O,,0O,,O., shown on the 

X-axis are simultancous in the frame S’. They occur at the same time 

x = 0. Projecting these three events on the X°-axis we find that they 
0 


occur at different time coordinates x9, xp, x2 in the frame S. 


7.4. Minkowski Space-Time 


What is the length of the segment of the world line between two event 
points Oy and Op? 

When I am sleeping, I am still walking a long way in space-time along 
the time axis. Can I say that the distance I have travelled is c times, say 
6 hours of sleep? 

During daytime I have commuted from Mysore to Bangalore, a distance 
of 140 km, in 3 hours. I have moved along some XY-plane, as well as along 
the time axis. Shall I apply the Pythagorean theorem to arrive at a distance 
of \/(3c)? + (140)? km, covered in space-time? 

We shall find an appropriate definition of “length” in space-time. The 
reader may ask, “What is the need for measuring length in an abstract 
four-dimensional world which we cannot even visualize before our eyes?” 
We need a measure of length because we are going to construct four- 
dimensional vectors in space-time. The most elementary such vector is a 
“directed straight line” from an event © to another event ®. Length is the 
only invariant (i.e. something that does not change with a change of the 
coordinate system) associated with a straight line, whether is space-time 
or in physical space. 
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Let us review how length is defined in analytical geometry in terms 
of coordinates. Consider a straight rod whose endpoints A and B are at 
the coordinates (x1, 41,21) and (#2, y2, 22) with reference to a Cartesian 
frame S. The length of the rod is given by the Pythagorean expression 

2 — (xo x1)” t (yo yi)? t (29 zi)*. (7.19) 
If we look at the rod from another angle, say, by shifting my telescope, or 
better still by rotating my frame of reference, I shall still measure the same 
length @ of the rod. 

Suppose we perform a rotation of the axes by an angle @ about the 
Z-axis, to obtain a new frame 9’. It is an elementary exercise? to show that 
the above rotation causes a transformation of coordinates from (a, y, z) to 
(a’, y’, 2’), given by 


x’ =xcos0 + ysin9, 
y =—axsin@ + ycos6, (7.20) 
2 =z. 
Therefore, if (2, y{, 24) and (a4, y4, 25) be the Cartesian coordinates of 
A and B in S$’, the transformed length ¢’ of the rod in S’ will be 
e? = (ay — 24)" + (yp — i)? + (2% -— 24)”. (7.21) 
Applying the transformation (7.20) we get 
? = {(a2 — 21) cos 6 + (y2 — yi) sind}" 
+ {—(x2 — x1) sin@ + (y2 — y1) cos 6}? + (z2 — 21) 
2 


2 


= (a2 — 21)" + (yo — 1)” + (22 — 21) 

= 7. (7.22) 

In summary, the length expression as given in Eq. (7.19) remains invari- 

ant in our familiar physical space, under all transformations of coordinates 

due to rotation. Since Euclid’s geometry is valid in this space, we shall 

often refer to this space as the Fuclidean space and denote it by the symbol 

E®. The square of the distance dé between two neighbouring points (2, y, 2) 
and (x + dz, y + dy, z + dz) in E® is given by the expression 

dl? = da? + dy? + dz. (7.23) 

We shall call Equation (7.23) an expression for the line element in E°. 

A formal name for the above expression is metric. Equation (7.23) expresses 

an Euclidean metric. 


>The transformation of coordinates by rotation about the Z-axis [23]. 
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We cannot expect invariance of the Euclidean metric (7.23) under a 
Lorentz transformation. This is because the length of a stick appears dif- 
ferent to two different observers who are moving relative to each other, so 
that ¢’ # £. On the other hand, if we consider two events 0 and ® with 
coordinates (ct1, 21, 41,71) and (ctz, x2, y2, 22) in some Lorentz frame S, 
then the value of the expression 


s=cts—hy-f 


= "(tg — t1)? — [(we — 21)? + (ye — 1)? + (22 -— 21)" (7.24) 


remains invariant under a Lorentz transformation (cf. Eq. (3.11)). There- 
fore, if we are looking for a candidate to represent “length” in an invariant 
way in space-time, then the expression give in Eq. (7.24) should satisfy the 
requirement. 

The letter s used above is the invariant “length” in relativity. However, 
since the word itself connotes a measure involving a meter-stick — as that 
of a reel of yarn or the width of a fabric — and since we cannot stretch a 
meter stick across space-time, it is better to suggest an alternative name, 
for which we choose “line interval”. 

The differential line interval between two infinitely close events having 
coordinates (ct,x,y,z) and (ct + cdt,« + dx,y + dy, z + dz) has a greater 
relevance in the geometry of space-time. We shall write this as 


ds” = c’ dt? — da? — dy — dz’. (7.25) 


Equation (7.25) will represent the line element, or the metric, better known 
as the Minkowski metric, for space-time. Any LT from a frame S to another 
frame $” will guarantee invariance of this metric: 


ds* = edt? — dz? — dy? — dz" = c* dt’? — de’? — dy? — dz". (7.26) 


This is a consequence of the definition of LT as written following Eq. (3.11). 

When the reader will take up study of the General Theory of Relativity 
he/she will realize that it is not possible to obtain the simple metric of 
Eq. (7.25) in the presence of a true gravitational field. Massive gravitating 
objects, like pulsars, quasars, black holes, distort the geometry of space— 
time grossly thereby invalidating the Pythagorean expression (7.23) for the 
length of a “straight line”. However, for our study of Special Relativity, 
from which gravity is excluded, we shall always assume the Minkowski 
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metric of Eq. (7.25). The space-time whose geometry is described by the 
Mankowski metric is called Minkowski space-time, and will be denoted by 
the symbol M*. 

The reader may have already noticed that the right-hand side of 
Eqs. (7.24) and (7.25) are not necessarily positive. In fact, they can be pos- 
itive, negative, even zero. When the events 0, ® for which we have written 
the line interval (7.24) are on the world line of a material particle, s? > 0, 
because all material particles move slower than light. In that case, we say 
that the line interval between these events, or the line element joining these 
events, is time-like. If s? < 0, the interval is space-like. On the other hand, 
if s? = 0, the interval is light-like. (We have already defined these terms in 
Sec. 3.4.) 

Suppose a star, which is 10 light years away, has “exploded today” into 
a supernovae. Call this event 0. We shall see this supernovae 10 years later 
in the form of a brilliant flash of light. Let this “seeing at a certain observa- 
tory” on earth be called the event ®. Then the line interval between these 
two events is zero, according to Eq. (7.24), even though the intervening 
“distance” between the events is 10 x 365 x 24 x 60 x 60 x 3 x 10° km!! 

In Sec. 7.8, we shall characterize a 4-vector as time-like, space-like or 
light-like depending on whether the square of its length is greater than, less 
than, or equal to zero. 


7.5. 3-Vectors, Contravariant and Covariant Families 


We shall be using vectors and tensors in E* as well as M*. The new 
species of vectors we shall be using in M* will be called 4-vectors, a term 
coined by Minkowski. To distinguish our familiar vectors, used so far in 
non-relativistic physics, we may refer to them as 3-vectors. 

It is easier to build the concept of vectors in E? and then extend the 
same to M*. A vector in E® is a directed straight line segment with the addi- 
tional property when two of them, A and B are added to obtain their sum 
C=A+B, this operation is carried out by constructing a vector triangle. 

The first one of these above two properties gives a geometrical character 
to a vector. If we have chosen a frame of reference, then the length and 
orientation of a vector V will determine its components V;,V,, Vz along 
the X-, Y-, Z-axes, which are obtained by projecting the vector along 
these axes. 

Conversely, given the components V,, Vy, Vz of a vector V with refer- 
ence to a frame of reference, we can construct the vector V in space as 
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a geometrical straight line. Therefore, it is often a practice to express a 
vector as an ordered triple of real numbers, which can be written either as 
a column, or as a row matrix. 


Vi 
V=|V, |; alternatively, V = (Vz, Vy, Vz). (7.27) 
Vz 


In view of the above matrix representation, the triangle rule of vector 
addition is equivalent to a matrix addition: 


Az a By 
A+B=| A,+B, |; 
A&B, (7.28) 


alternatively, A+B=(A,+B,,A,+ By, A+ Bz). 

The most elementary vector, which also serves as the patriarch of one 
family of vectors known as contravariant vectors is the infinitesimal dis- 
placement from a point P(z,y,z) to a neighbouring point Q(# + dz, y + 
dy, z + dz), so that 

dr = (dx, dy, dz). (7.29) 


When we change over from a frame of reference S to another one $”, which 
is rotated with respect to the first one, the coordinates get transformed 
according to Eq. (7.20). As a consequence, the vector dr changes its com- 
ponents from (dz, dy, dz) to (dax’, dy’, dz’), where, 

dx’ = dacos@ + dysin 6, 


dy’ = —dx sin @ + dy cos0, (7.30) 
dz' = dz. 
Starting from the primary vector dr one obtains, through multiplication 


with scalars (e.g. 1/dt, m), and differentiation with respect to time t, other 
members of the contravariant family are obtained, e.g. 


1 dr ene 
velocity Laake: x dr Ti (multiplication), 
dv ' ie 
acceleration a= — (differentiation), 
dt (7.31) 
momentum p=mxv=mv — (multiplication), 
force F= ue (differentiation), 
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and so on, all of which share the transformation of dr. One can now define 
a set of three real numbers V = (V,,V,, Vz) to constitute a contravariant 
vector, if they transform under a change of coordinates exactly like the 
components of dr. The rotation of the vector V will change its components 
from (V;,Vy, Vz) to (Vz, Vy, Vz), such that 

Vi = V, cosé + Vy sin 9, 


V, = —Vze sin 6 + Vy cos 6, (7.32) 
V, = Ve. 


To be more precise and general, we replace the particularly simple trans- 
formation formula (7.20) to 


a = fe" 2.2) = f* (x), k = 1, 2,3, (7.33) 


where we have written (x) to mean all the old coordinates (21, 2, 73). Note 
that we have used superscripts (1,2,3) on a to mean «x,y,z, respectively. 
A superscript will be called a contravariant index. We shall follow this 
practice in the remaining part of this book. 

Equation (7.33) represent a general change-over from the old coordinates 
(x) to (x) — most common examples being (1) Cartesian to Cartesian 
due to rotation of axes, just cited, (2) Cartesian (x,y,z) to the spherical 
coordinate system (7,6, @), or vice versa.© 

The coordinate differentials (dz’) in the old system, and (dz) in the new 
system, are connected by the linear expression 


3 3 
Or . 
tk ) _ . k = 

dx = Oui => RR jaa’, k = 1, 2; 3, (7.34) 

pat j=1 

where the coefficients 

. Og” of 
R= = SS, 8g = 1,93 7.35 
om) Oxi Oxi 2 oJ ? ? ? ( ) 


constitute a 3 x 3 matrix R. They are, in general, functions of the coordi- 
nates (x). For the special case of rotation, these coefficients are independent 
of the coordinates and constitute an orthogonal matrix R. An orthogonal 


“One has to be careful before applying this transformation rule to curvilinear coor- 
dinates. A distinction between the “physical components” and tensor components has 
to be made, bringing in the scale factors before making any serious application, say, in 
mechanics. 
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matrix is one for which its inverse equals its transpose, as we have explained 
in Eq. (7.45) below. 

Note the dot “.” followed by a small space in the subscripts, to imply 
the second position for the subscript, i.e. its role as the column index. The 
superscript k and the subscript 7 in Re constitute the row index (first 
index) and the column index (second index), respectively. 

For the case of the rotation with transformation represented in 
Eqs. (7.30) and (7.32), R1, = cos0; R!, = sin0;.... We write the com- 
plete matrix: 


cos@ sind O 
R=|-sind cosd 0]. (7.36) 
0 0 1 


Though simple, the above matrix is used to construct the most complex 
rotation matrix of a rigid body, consisting of precession, spin and nutation.¢ 

In order to avoid writing the summation symbol © repeatedly — for 
which occasions will arise copiously in the sequel — it has been a common 
practice among relativists to adopt the Einstein Summation Convention. 
For this purpose, we introduce a contravariant index as the one which 
appears either as a superscript in the numerator, or as a subscript in 
the denominator. Its opposite is the covariant index, which, therefore, 
appears either as a subscript in the numerator or as a superscript in the 
denominator. In the expression (7.34), for instance, k appears as a con- 
travariant index on either side of the equation, whereas j appears first as a 
covariant index, and then as a contravariant index. 

Einstein convention proposes that if in a certain term of an expression 
the same index appears twice, once in the contravariant form and another 
time in the covariant form, then that index is to be interpreted as a sum- 
mation index, and the formal summation sign © is to be dropped. 

Adopting this convention we rewrite Eq. (7.34) as (compare this with 
the convention adopted on page 147) 


dx!* = a J—R¥ di, k=1,2,3 (7.37) 
—- Ovi an ey) $ ee gaa g Se “ 
A summation index, e.g. j in Eq. (7.37), is a dummy index. A dummy 


index can be replaced by some other dummy index. For example 


da” =A dat = Rae, (7.38) 


Goldstein, op. cit., p. 147ff. 
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We shall complete the suggestion following Eq. (7.31) to define a con- 
travariant vector A to be an ordered triple of real numbers (A”) that trans- 
form under a coordinate change (2) to (2’) of the form (7.33) to another 
triple (A’’), such that 


Ox!* 


k _ pk j iS 
A’ = YAS, where Rr; = Bai’ 


(7.39) 
with k, 7 = 1, 2,3, are the components of a 3 x 3 matrix R. 

Note that we have indicated the components of the contravariant vector 
with a superscript. 

Equation (7.32) is a special case of (7.39), in which the components Rk, 
of R are constants, i.e. independent of the coordinates. This happens in 
the case of Cartesian to Cartesian transformation, like rotation of the axes. 

Another class of vectors extensively used in physics is the covariant 
family. Their prototype, the gradient of a scalar field w(a, y, z) is denoted 
as Vw. Let us write 


G=Vy, 
an (7.40) 
having components Ge=z Zz, k=1,2,3. 
Ox 


Under the transformation (7.33), these components will change to Gyr = 
2» in the system (x), Using the chain rule of calculus, one then gets 


Ox!k 
Ow Ow Ox . 
Qo =—o = = GM! 
ke 4 tk J Kk? 
Ox | Ox) Ox (7.41) 
Jj 
where M?, = — 


are the components of a 3 x 3 matrix M. 
Let P = MR be the product of the two matrices M and R. Then 


g On Ox” -Ox? j 
P; == ST = 5y, 
Ox'* Ox® — Oa® (7.42) 
or MR=1= identity matrix. 
Equation (7.42) shows that the matrix M is identical with R~!. We 
can therefore rewrite the transformation equation (7.41) for the gradient 
vector as 


Gt, = G;(R71)i. (7.43) 
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Therefore, we define a covariant vector V as an ordered triple of real 
numbers (V;,,) that transform like Vw under a change of coordinates (2) to 
(x’). This means that the transformed components of the vector will be: 


Vi = Vj(BNY. (7.44) 


Note that we have indicated the components of the covariant vector 
with a subscript. 

It is then clear that the contravariant vectors and the covariant vectors 
have distinctively different characters. Isn’t it then strange that in physics 
so far we have treated them alike and represented both by directed straight 
lines? 

The reason is that the formal difference between Eqs. (7.39) and (7.44) 
disappears when we consider transformation from one Cartesian system 
S to another Cartesian system S’ due to a rotation of the axes. The 
transformation matrix R is then an orthogonal matrix, as stated follow- 
ing Eq. (7.35). By its definition, the inverse of an orthogonal matrix is its 
transpose. That is, 


7-1 trans 
O-=0 ‘ 


(7.45) 
or (0-7), =O, 
Using this property, we can rewrite (7.44) as 
3 
y=) RLY: (7.46) 
k=1 


The above transformation rule is identical with (7.39). It is for this reason 
that the discriminatory labels “contravariant” and “covariant” are unnec- 
essary in classical physics. 

When the components of a 3-vector V change from {V,} to {Vj}, 
through an orthogonal transformation, its length, by which we mean its 
magnitude, does not change, i.e. 


By 3 
Snr =r (7.47) 
k=1 j=l 


as exemplified in Eq. (7.22). We shall adopt Eq. (7.47) for defining a 
3-vector. 
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7.6. 3-Tensors, Contravariant and Covariant Families 


A vector is a geometrical object that does not change with a coordinate 
transformation, like a hexahedron which remains the same geometrical 
object no matter from which angle you look at it. This means that if under 
a coordinate transformation (x) to (a) the scalar components of a vector 
V change from (V;,, Vy, Vz) to (Vz, V,, Vz), then the unit vectors (e,, ey, ez) 


/ / 
ye) such that 


will change to (e/,,e 
Vrexz + Vyey + Vzez = Viel. + Vie, + Viel. 
Sameas, Ve; =V’el. 


However, V’* = R*,V!, from Eq. (7.39). 


Vi =e! Rk. yi 
Hence, ejVJ =e, RV’, 
or e; =e, R*,, 

-1)\I g.— oo! Rk (p-1)\I — al sk —ol 
or (R ype) =e, RE (KR \ig = &,0%¢ = 


Note that (R~1)’, represents the j component of the matrix R~!, the 
inverse of R. Hence the theorem. 


Theorem 7.1. Let R represent the transformation matrix corresponding 
to a change of coordinates (x) + (a'), so that a contravariant vector A 
changes its components from (A*) to (A) by the rule given by Eq. (7.39). 


Then, the transformation of the unit vectors (e;) — (e}) will be given by 


the rule: 
e} =e, (R-')’,, (7.48a) 
and the converse, 
e, = eR’). (7.48b) 


We shall obtain the transformation formula for the contravariant 3- 
tensor T’* using Eq. (7.48)(b). 


Let T=T**e,e, = Tele}. 
Now, Tee, =T* {el Ri, }{e,R’,} = {Ri RY,T* yee. 
Hence, 


Te a RT, (7.49) 
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Going back to Eq. (7.39) we draw the following conclusions: 


e A scalar is a real number that does not change under a coordinate trans- 
formation. 

e A vector is a row or column matrix of three real numbers that transforms 
through a single application of R as shown in (7.39). 

e A tensor is a 3 x 3 square matrix that transforms through a double 
application of R as shown in (7.49). 


We shall illustrate the concepts outlined above by citing the example 
of rotation about the Z-axis for which we had written the transformation 
matrix R in Eq. (7.36). Its inverse R~! is its transpose. We shall write R 
and R-! side by side for contrast: 


cos@ sind O cos@ —sin@ 0 
R= 1|-sin@ cos6 0}. R'=| sind cosd O|. (7.50) 
0 0 i} 0 0 1 


The base vectors should now transform as 


so that, ef =e,(R7')!, + e.(R-!)?, = cos de, + sin deg, 
1=e1(R“*) teh )*, 1 2 (7.51) 


es =e;(R71)!, + e2(R1)24 = — sine + cos fea, 


Comparing Eq. (7.51) with Eq. (7.32), we find that the vector components 
(V;,V,,Vz) are the same linear combinations of (V;,,Vy,Vz), as the unit 
vectors (e/,, e5,e5) are of (e1, e2,e3), even though they follow different rules 
of transformation. This is because in the first case the matrix R appears 
before, and in the second case the matrix R-1 appears after, the quantities 


being transformed. Let us highlight this. 


yl" cos@ sin# 0 vi 
Vv? | =] —sin#@ cosé 0 Vy? 
ye 0 “il y3 
Vicos6+V? sind 
=| —-V'sin6+V?cosé |, (7.52) 


y3 
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cos? —sin@ 0 
(ee, es) = (e1 e2e3) | sin®d cosd 0 
0 0 1 


= (cosfe; + sinfez, —sinfe; + cosfez, ez). (7.53) 


We shall now obtain the transformed components of a contravariant 
tensor T under the same rotation of the axes (about the Z-axis) for which 
the transformation matrix R is given in (7.50). We shall illustrate the pro- 
cedure for only one component, namely, T”!?. 


r= Ri Rers 
= RI RYT™ + RY RT” + Ry R?3T + Ri,RT™ 
+ RI,R%,T” + RigR?,T” + RigRT* + RigR?9T* 
+ Rye 
= (cos 6)(— sin 0) T1* + (cos) (cos 0) T? 
+ (sin @)(— sin 8) T? + (sin 4) (cos 0) T?”. (7.54) 


The reader should complete the work by working out the transformation of 
the other eight components. Here are all the transformed components: 


T! = T" cos? 9 + (T!? + T?*) cos sind + T?? sin? 9, 
Tl? — T™ cos? é + (T™ — T™) cos @sind — T™ sin’ 8, 
T'S —T cosd + T*? sin 8, 
T”) = T"' cos? 6 + (T” — T") cosOsind — T” sin” 6, 


T'? — T” cos? 6 — (T? +T™) cos@sind +T™ sin? 4, (7.55) 
TT’? = T cos6 —T sin 8, 
T! = T*! cos6 + T* sin 0, 
T'? — T*? cos6 — T*! sin 8, 
133 _ p33 
A contravariant 3-tensor like T can find its companion in the covariant 
3-tensor P such that P - T is a 3-scalar. The transformation rules (7.71), 


(7.75) and (7.49) would suggest that the components of P should transform 
{Pmn > Peis h as 
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Pri = ye Ve (7.56) 


Equations (7.49) and (7.56) constitute the transformation rules for a 
contravariant and a covariant 3-tensor. 

We shall show that the above transformation rule will ensure invariance 
of P- T under the transformation R. 


Proof. 
Pp 


mnt” = {Pie( RY (Ro) nA HRG RL} 


Sh Vg Cn eee 
a0 fal = Pl™. (7.57) 
(QED) 


There is one tensor which is not related to any physical quantity in 
physics. It is the metric tensor, which we shall denote as g, and its 3 x 3 
components as {g;;; 1,7 = 1,2,3}. In Relativity, particularly in the General 
Theory of Relativity, g plays the most dominant role, being synonymous 
with the gravitational field itself.° The metric tensor defines the geome- 
try of space, or space-time, through an expression of the line element. 
For a Cartesian coordinate system in H?, the expression of the line ele- 
ment is written in Eq. (7.23). For a spherical coordinate system (1,9, ¢), 


{z' =r; x? =96, 23 = o} and the same line element is written as 
dé? = dr? + r°d6? + r? sin? 0. (7.58) 
The metric tensor’s role in EF? is to write all line elements as 
dl? = gijdx'dx. (7.59) 
It is then seen from (7.23) and (7.58) that 
1 0 0 
g=G°" =5;;=| 0 1 0]; Cartesian coordinate system, 
0 0 1 
(7.60) 
1 0 0 
g=G°™ =| 0 (x)? 0 ; spherical coordinate system 
0 0 (a sine? )? 


°“Metric as the foundation of all”— p. 304 of Ref. [5] 
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Why do we call g, whose components are defined by Eq. (7.59), and 
exemplified by (7.60) a tensor? The reason lies in the Quotient Theorem 
stated in Sec. 7.9.4. The left-hand side of Eq. (7.59) is a scalar, whereas 
dx'dx) is a contravariant tensor of rank 2 (having two superscripts). There- 
fore, gi; should stand for a covariant tensor of rank 2 (i.e. with two sub- 
scripts). 

We shall demonstrate the above tensor character of g by showing that 
under an orthogonal transformation (e.g. rotation of coordinate axes), the 


transformation rule (7.56) will convert g(") (same as 6;;) to itself. 


Proof. 
= 6ij(R')m(B on = (RV m(R Yn 
= RR), = binn- (7.61) 
(QED) 
We shall further demonstrate the tensor character of g by showing that 


when the coordinate system changes from Cartesian to spherical, the same 
transformation rule will convert 9") to gP"). 


7.7. Transformation of the Metric 3-Tensor from 
Cartesian to Spherical 


As in Eq. (7.61), 


Imm = Gen” = Sy Ro) (RY, = (BY (BOY a: (7.62) 


ae 
The transformation from (z) to (x’) is given in Eq. (5.35). (R71), = 2a. 


Therefore, (R~1)!, = 22 = sin@cos¢; (R71)!, = 9% = rcos@cos¢, etc. 


We therefore have the following (inverse) transformation matrix: 


sin@dcos@ rcos@cos@ —rsinésind 
R-'=| sinOsing rcos@sing rsin@cosd |. (7.63) 


cos 6 —rsind 0 
It then follows from (7.62) that 
RP” = (RYAP + (YAP + (ROP = 1 


In this way, the other components of g@P") as given in (7.60) can be 
obtained. 
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In writing equations of physics that respect relativity, g remains in the 
background, changing indices — contravariant to covariant and vice versa, 
and in the process helps us write the laws and principles of physics in the 
relativistic, covariant language. 


7.8. 4-Vectors in Relativity 


The simplest 4-vector, by which we shall mean a vector in M‘, that comes to 
one’s mind is a directed straight line segment or stretching from some event 
© (which could have occurred yesterday) to some other event Y which might 
have occurred in the past, or may occur tomorrow. Since such a vector 
spans time as well, we cannot represent it with an earthly rod. 4-vectors 
cannot be “pictured”. We can only write them on a piece of paper as an 
ordered quadruple of four real numbers, e.g. 


OY = (52°, 621, da”, 5x°), (7.64) 


or draw a space-time diagram using 4 axes cT, X, Y, Z with one of the axes, 
e.g. the Z-axis suppressed, and show it as a geometrical straight line, with 
an arrowhead showing its direction from 0 to WV. 

Note that, the last three numbers, i.e. the ordered triple (521, 6x, dx), 
constitute a 3-vector dr, stretching from the spatial location of © to the 
spatial location of VW. We shall often find it convenient to write a 4-vector 
as a 1+3-component object, e.g. (62°, dr). 

The primordial 4-vector, or, the most elementary 4-vector, the ances- 
tor of all 4-vectors of the contravariant 4-vector family, is the infinitesi- 
mal displacement 4-vector d¥ in M 4 stretching from an event © to an 
infinitesimally close-by-event V, the time-space separation between them 
being (dx°, dx!,dx?,dx*). Depending on our convenience, we shall write 
this vector, and its progenies, in four different ways (cf. Sec. 7.1): 


d¥ = (dx) = (dx®, dx', dx?, dx*) = (de®, dr). (7.65) 


Note that in the last equation we have clubbed the three spatial compo- 
nents of the elementary 4-vector 6Y as dr = (dx', dx?, dx) = (da, dy, dz). 
These spatial components, isolated out under the banner dr, constitute a 
3-vector, the object of our discussion in Sec. 7.5. We shall give a better 
definition of 3-vector below Eq. (7.71). 


Space-Time and Its Inhabitants 223 


Under a general coordinate transformation (a) to (a’"), u = 0,1, 2,3, 
a’! = f* (2), (7.66) 


the coordinate differentials (dx) will transform to (dx’), according to the 
rule 


Ox! 
da! = de” =08,de”, 2 = 0,1,2,3, (7.67) 
where the coefficients 
Oa!!! 
==, p,v=0,1,2,3, (7.68) 
i Ox” 


constitute a 4 x 4 matrix 2. In the case of Lorentz transformation, the 
transformation coefficients are constants, as in the case of rotation in E°, 
cf. Eq. (7.36). It is the same LT matrix written as Eq. (3.14) in Sec. 3.2, 
and copied into Eq. (7.69) below, in which we have also spelt out the time 
index 0, and space indices 1, 2, 3 for the four rows (arranged vertically on 
the left) and the four columns (arranged horizontally at the top): 


0 i 2.3 
0 ey 
QG=1|-78B y O00 (7.69) 
2 0 0 10 
3 0 0 oO1 


By analogy with Eq. (7.39), we now define a contravariant 4-vector to 
be a vector 


A = (AM) = (4°, A), (7.70) 
whose components transform from (a) to (a) according to the rule: 
Al =O, AY, w=0,1,2,3. (7.71) 


Note that A° is the time component, and A is the space component 
(or, rather the three spatial components taken together) of the 4-vector A. 
We shall treat A as a 3-vector, as mentioned below Eq. (7.65). Confining 
ourselves to Cartesian systems (so that we do not have to invoke metric 
tensor) a 3-vector A possesses the property of invariance under an orthog- 
onal transformation (e.g. rotation of the coordinate axes, but not boost), 
as per the definition given below Eq. (7.47). 
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We shall now come to the four-dimensional covariant vector. Imagine 
the gradient vector Vw upgraded from E* to M+. We get the 4-gradient 
of a 4-scalar field Vu. 

A 4-scalar field U(x) as a single component field, i.e. represented by 
a single function of the four coordinate (7), such that the value of the 
function does not change at any event point “O” under a change of the 
coordinates from (x) to (x’) due to a transformation of the type (7.66). In 
other words, if Y(a) > W’(x’) under the above coordinate transformation, 
then W’(2’) = W(z). 

Now we define the covariant 4-gradient of such a scalar field VU as the 
4-component field: 


G=Vyu, 
: Ow 
having components G, = V,U= ane = 0,1, 2,3, (7.72) 
a low OW OW OW 
- ge a oS) 


Note that we have used a leftward arrow to imply a covariant 4-vector, 
and a subscript to indicate its components, in contrast with the contravari- 
ant 4-vector (in Eq. (7.70) for which we have used a rightward arrow, and 
superscript to indicate its components. oe 

Under the coordinate transformation (7.66), the 4-vector G changes to 


<— 
G’, and their components change: 


aw’ OW ax” 


lal ee _ 
Vil a VW = aah Bae Doe? = 0,1, 2,3. (7.73) 
Analogous to Eq. (7.42), we have here 
Ox” Ox! Ox” ‘ 
(22%) (20) 90" me 
Going back to Eq. (7.68) we now identify (3) as the Q/,, component 


of the matrix Gq, and now using (7.74), ($5 ) as the (Q7")”,, component 


Og"! 


of the inverse matrix (yt. 
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We therefore define a covariant 4-vector B = B,, = (Bo, Bi, Bo, Bs) to 
be an ordered quadruple of real numbers that transform under the coordi- 
nate transformation (7.66) as 

B= Bala Fs (i) 
The covariant gradient G = Vu obviously satisfies this requirement. 

As a concrete example we shall write down the Lorentz transforma- 
tion equations for a contravariant 4-vector A and a covariant 4-vector 
corresponding to a boost cBe; in the X!-direction. The relevant matrix 
has already been given in Eq. (7.69). Its inverse can be determined either 
rigorously using matrix algebra, or by a short-cut, replacing 8 with —(. 


0 1 2 83 

0 y 7B 0 0 
Q!i=~1)78 yy O 0 (7.76) 

2 0 O 1 0 

3 0 0 01 


Using the matrices (7.69) and (7.76) in the transformation formulas 
(7.71) and (7.75), respectively, we get the transformed components of these 
two 4-vectors: 


A’l = 7(A! — BA), (7.77a) 
A” = A’, 
AB = A? 


Bo — (Bo Tv BB), 


By = y(Bi + BBo), (7.77b) 
Bb = Ba, 
Bi = Bs 


For future use we shall also write the LT formula for the contravariant 
vector A corresponding to a general boost: [S(c@)5S’] along any arbitrary 
direction n. That is 8 = Gn. For this purpose, we adopt the formula (3.18) 
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given in Sec. 3.2, and replace ct with A° and r with A. The result is [24] 


A” = 4(A° — B-A) (7.78a) 
= A°+[(y-1)A° - 76 - Al, (7.78b) 
A’ = (yA, + A1)—7BA° (7.78c) 
=A+|7*(6-A)B ~ 784° (7.784) 
=A-+n{(y—-1)(m- A) - yA". (7.78¢) 


The corresponding Lorentz transformation matrix is the same as Eq. (3.20). 
The inverse of the above transformation, corresponding to the boost: 
[S’(—)S], is obtained by replacing @ with —G@ (same as replacing n with 
—n) and (A°, A) with (A”, A’): 


A® = 7(A% + B-A’) (7.79a) 
= A+ [(y- 1A? +78- A), (7-19) 
A = (yA, +A.) +78A4° (7.79c) 
= aly [25 *(8-A')B +> BA” (7.794) 
= A! +n{(y—1)(n- A’) +78A”]. (7.79e) 


As for a graphical representation of vectors in M+, we shall continue to 
rely on the straight line picture for the contravariant vectors. The covariant 
vectors can be represented by a succession of parallel planes. The orienta- 
tion of the planes (i.e. the normal drawn on them) will indicate the “direc- 
tion” and the spacing the magnitude. The closer the spacing, the larger 
the magnitude. 


7.9. 4-Tensors in Relativity 
7.9.1. Contravariant, covariant and mized tensors 


Tensors of relativity will be defined as matrices subject to specific transfor- 
mation rules. A 4-scalar is a tensor of rank 0. A contravariant vector and 
a covariant vector are both tensors of rank 1 and can be represented as a 
4x1 or 1x4 (ie. column or row) matrices. A tensor of rank 2 is a 4 x 4 
matrix. 
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By an extension of the defining equation (7.49), we shall call the ele- 
ments of a 4 x 4 matrix (T”) as constituting a Contravariant Tensor of 
rank 2, if under a coordinate transformation of the form (7.33), they change 
into (T’“”) according to the following rule: 


Pee = OF Orr, (7.80) 


Similarly, by an extension of the defining equation (7.56), a 4 x 4 matrix 
(Ky) which transforms into (Kj,,,) according to the rule: 


Kw > Kyy = Kap(0™)%, (0-7)? (7.81) 


Vv 


will constitute a Covariant Tensor of rank 2. 
On the other hand, the matrix (Q”“,) which transforms into (Q’) 
according to the rule: 


Oe ard Oe = 0K, Q% Co mea (7.82) 


will constitute a Mixed Tensor of rank 2, having contravariant character 
with respect to the first index, and covariant character with respect to the 
second one. Note that Q", and Q:/ are not the same tensor. 

By obvious generalization, one can extend the definition to any 4!!! x 
4] x ... x 4l"] matrix and call it a tensor of rank n by ascribing to it 
the necessary transformation properties. For example, the 4 x 4 x 4 matrix 
(AM 


. ie) 
followed by two covariant indices, if 


will represent a tensor of rank 3, in which a contravariant index is 


Ab yg > Alby = Mg A%py (Oh (O77. (7.83) 


As an illustration, we shall obtain the transformation equations for a 
contravariant and a covariant tensor of rank 2, corresponding to the boost: 
S(6,0,0)S’. 

First the contravariant tensor. We shall illustrate the procedure by 
obtaining the transformed component T’°°. The relevant LT matrix Q is 
shown in Eq. (7.69): 

00 _ G9 Q0 ma 
=a 
= Mg T? + OP4Q?, TO? + MD OPOT* + 02,09, TE 
= =p ee. (7.84) 


Note that we avoided writing zero terms, like 0°,0°,7", etc. 
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Next, the covariant tensor. We shall demonstrate the procedure for the 
transformed component Kj . The relevant inverse LT matrix is shown in 
Eq. (7.76). Hence, 


Koo = Kap(Q71)%(271)% = 77 [Koo + B(Ko1 + Kio) + 6? Ku]. (7.85) 


We shall now write all the transformed components corresponding to 
the boost: $(6,0,0)S’. 


100 = (Te Ar Tt) Aer |, 102 = aes = pr), 
Ti01 = qtr prs pu) ir, 7120 = A Ha = or), 
T110 — ei er J as) or, T1038 — cr _ pr). 
jn — aatpu pr T's) B-T™|, 7130 — “(Tr = pr), (7.86) 
qi2 — (TY? — pr), Til _— 4(T24 pr), 
T’!3 = 4(T3 — BT), T! = (T3! — BT), 


T!22 = [22 T7123 = Ts. T7132 = T°; T7133 = Sale 


( ] 

Ko, = 77(Ko1 + B(Koo + Kir) + 6? Ko], Bao = = 7(Ko0 + BKa1 
(Koo + Ku1) + 6? Koi] 

Ki, = 7° [Kiur + B(Ko1 + Kio) + | 

Kg = ¥(Ki2 + BKoo), Kd, = y(Kai + BK20), 
Ki3 = y(Kis + 6Koa), Kk}, = y(Kai + 8K30), 
Kon = Ko, Kb, = Ko3, Kho = Kao, K43 = B33. 


(7.87) 


7.9.2. Equality of two tensors 


If the components of two tensors A and B are equal in a preferred coordi- 
nate system, say S’, then their components are equal in any general coor- 
dinate system S', and we say that A=B. 

We shall prove this theorem assuming that A, B are contravariant ten- 
sors of rank two. The two tensors being equal in $”, the components of 
the tensor N = A — B must the zero in $’. That is N’“” = 0. To get its 
components N#” in S, we apply the inverse of the transformation given in 
(7.80): 


NOP aN (ye al (7.88) 
(QED) 
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7.9.3. The metric tensor of the Minkowski space-time 


We had introduced the concept of metric, and metric tensor, in Sec. 7.4, 
through Eqs. (7.23), (7.25), and again in Sec. 7.6, through Eq. (7.59), and 
casually mentioned the pivotal role it plays in relativity, especially General 
Relativity. The metric tensor g,, of Minkowski space-time is defined as 


ds* = g,, dx" dx”. (7.89) 


The Minkowski metric was written in (7.25), which we rewrite in our new 
index format as 


ds* = (dx”)* — (dz)? — (dz)? — (dz). (7.90) 


Comparing the two equations, we get the Minkowski metric as the fol- 
lowing 4 x 4 matrix: 


0 12 3 
0/1 0 0 0 
tg =) |2al. oo . (7.91) 
2/0 0-1 O 
3 \0 0 0-1 


We shall now establish the tensor character of g,,,._ We cannot follow the 
path of (7.61), because in this case the transformation is not an orthogonal 
one. We shall demand that under the transformation (7) > («’) the metric 
must remain invariant. That is, 


ds” = gy, da" da” = Jog dx” dx’? , (7.92) 
The right-hand side is 
Fag (Q", dx") (Q", dx”), 
leading to the transformation rule 
Inv = Gopi. Qs; (7.93a) 
and its inverse: gig = GO RO (7.93b) 


Equation (7.93b) conforms to the transformation rule for a covariant tensor, 
as defined in Eq. (7.81). Hence, g,, is a covariant tensor of rank 2. 


(QED) 
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If the transformation (2) — (a’) is an LT, so that the changeover is 
from one Cartesian system to another, then as per (7.92) 


ds? = (aa)? _ (da’)? _ (da’”)? _ (dx’?)?, (7.94) 
so that 
1 O 0 0 
no 0 -1 O _ (7.95) 
Jap = 6 0 =1 = Jap: : 
0 O 0 -l 


The same structure of the metric tensor is preserved under LT. This can 
also be seen by working out the components of g/,, directly from (7.93b). 
It now follows from Eq. (7.93b) that 


Juv = GapQhQ”s- (7.96) 


We shall now define a Lorentz transformation to be a linear transfor- 
mation — from one Cartesian system (x) to another (a’) — such that 
the transformation matrix 2 satisfies Eq. (7.96). The simplest example of 
such a matrix © is (7.69), and a more general one is (3.20). Also note 
that the above definition is a restatement of the property of the LT given 
in Eq. (3.44). 


7.9.4. New tensors from old ones, index gymnastics 


A special significance of the metric tensor is that it is instrumental in gen- 
erating a class of “converts” through two operations, namely, lowering and 
raising an index. 


Lowering an index 


In Eq. (7.64) we had written the primordial contravariant 4-vector dy 
as dx" = (dx°, dx', dx”, dx?) = (cdt, dx, dy, dz). One can now define four 


infinitesimals dx, by the operation: 
dip, = Guvdx”. (7.97) 
It follows from the g,,, matrix given in (7.91) that 


dz, = (dx°, —dx*, —dx”, —dz*) = (edt, —dx, —dy, —dz). (7.98) 
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In general, any contravariant vector V" can find its covariant counter- 
part V, through the operation: 


Vi = Iv V". (7.99) 
This V,,, as much as dz,,, is a covariant vector. We shall prove. 


Proof. Consequent to Lorentz transformation V,, should change to V,/, as 
per the rule: 


Va = GurV" = gap(Q7")%, (Q7*)F,07,V" 


Ht 
= gapV°(2*)%, 55 = gapV"(Q7*)%, 
2VA0 (7.100) 
This confirms the covariant character of V,,. (QED) 


The operation (7.99) is called lowering an index. It consists of multi- 
plication with g,,,, followed by a summation over the contravariant index 
that is to be lowered. We can geneleralize this lowering operation to cover 
any tensor that has at least one contravariant index. For example, consider 
the tensor T#,°. We shall show two operations, the first one will lower 
the first contravariant index ys, and the second one will lower the second 
contravariant index o. 


ip Sa Saal (7.101) 
Lo as Lae EP Gee (7.101b) 


The reader should prove that the new matrices (a) T;;7, (b) T4%, — 
obtained by the lowering operations are tensors. 


Raising an index 


The converse of the lowering operation is the raising of an index, for which 
one employs the contravariant metric tensor g"”, defined by the relation: 


I” Juv = OE. (7.102) 


The above equation means that in any coordinate system, (g“”) is the 
inverse of (g,,). In particular, if the coordinate system is Cartesian, then 
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(reader verify) 


1 O 0 0 
: =f o 0 
0 O QO -1 


As in the case of the lowering operation, we define the raising operation 
as multiplication with g’”, followed by a summation over the covariant 
index (the tensor being operated upon must have one or more) which is 
being raised. We shall show two examples: 


Fi 1 FY = gt Fa; (7.104a) 
Ce a ae (7.104b) 


The reader should prove that if the operated matrix (in this case 
F,,,T",,) is a tensor, the end matrix (in this case F",T%,”) is also a tensor 
with a reshuffle of one of its indices. 

The reader may have found out by now that in either the lowering or 
in the raising operation the teme component does not change, whereas the 
space components change their signs only. For example, 


if At = (A°, A), Ky = (Ko, K), 
(7.105) 
then A, =(A°,—A), K* = (Ko,—K). 


We have provided a few examples of raising and lowering in 
Appendix A.2 

There are three other ways of creating new tensors, that do not involve 
the metric tensor directly. We shall briefly explain them. 


Multiplication, division, contraction, scalar product 


The “tensor product” of two tensors A and B of ranks m and n, respectively, 
is a tensor C = AB of rank m+n. For example, if A”” and B;*g are two 
tensors, then 
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OM 2a Ae BY, (7.106) 


is a tensor of rank 5, having three contravariant and two covariant indices. 
This is the Product Theorem. 

The inverse of multiplication is quotient. If C = AB and C and B 
are tensors of ranks m and n, and m > n, the quotient A = C/Bisa 
tensor of rank (m—n). For example, if the relationship (7.106) is known to 
hold, and if C¥",°, and B;°, have been confirmed to possess the desired 
tensor properties, then A“” must be a tensor of rank 2. This is the Quotient 
Theorem. 

If F is a tensor of rank n > 2, and if it has atleast one contravariant 
and at least one covariant index, then by equating one contravariant index 
to one covariant index and summing over it, one gets a contracted tensor 


of rank n — 2. For example, if F'°",, is a tensor, then 


FO = Ft. (7.107) 


is a tensor of rank 2. This is the Contraction Theorem. The reader must 
prove the above three theorems. 

Using a combination of lowering (or raising), tensor product and con- 
traction operations, we shall obtain the scalar product of two contravariant 

F re : 

(or two covariant) vectors. Let A = (A“), B = (B") be two contravariant 
vectors. Using the lowering operation, product operation and contraction, 
we get 


TY = AMBY; TY = AMB, + TH, = AYB,,. (7.108) 


—> 
We shall call A“ B,, the scalar product of the two contravariant vectors A 
and B, and write this as 


A.B = A“B, = A°Bo— AB! — A2B? — A3B3 = AB A.B. 


(7.109) 


The object x . B is a tensor of rank zero, having been formed by 
contraction of a tensor of rank 2. Therefore, it is a 4-scalar. Its value will 
be the same in all inertial frames. Since two inertial frames are connected 
by a Lorentz transformation, a 4-scalar is also called Lorentz invariant. 

In particular, if we take the scalar product of a 4-vector, contravari- 
ant or covariant, with itself, we get the square of the magnitude of the 
4-vector, often called the norm of the vector. For example, the norm of the 
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; rae 
contravariant vector A is 


(A)? = AMA, = (A°)? — (A)? — (A?)? — (48)? = (4°)? — A. (7.110) 


The norm of a 4-vector is Lorentz invariant. 


The gradient and the d’Alembartian operator 


We had introduced the 4-gradient operator V,, in Eq. (7.72), as a vehicle 
which itself is not a tensor, but changes a 4-scalar field into a covariant 
4-vector field. Using “index gymnastics”, we shall obtain the contravariant 
form V“ and the contracted form V,,V" of the same gradient operator, as 
all of these three forms will have important applications in the remaining 
chapters of this book. 

It follows from (7.72) that 


o,-(2.8 8 0) (la 000 
# \.€2°’ Oa!’ Ox?’ O23] ~~ \.c Ot’ Ox’ Oy’ Az)’ 
SO 
02°’ Ox!’ Ox?’ Ax?) \cdt’? Ox’ dy’ Az)’ 
i: 8 Oe PN fle 
COR Oa2 Ody? dz2) \e ar 


VE = gl"Vy = ( 


-v*) ath) 


where V? = 
operator L7? is called d’Alembertian operator. It is then obvious that if ®(«) 
is a 4-scalar field, then (i) V,,®(z) is a covariant 4-vector field, (ii) V“®(z) 
is a contravariant 4-vector field, and (iii) 0?®(z) is a 4-scalar field. 

For future convenience, we shall rewrite Eqs. (7.11la) and (7.111b) as 


ace 
c Ot? Ox’ Oy’ Oz 


2 + ae + is the familiar Laplacian operator, and the 


VeCp=iVevieS ( 


10 
= (SH): 


V = (V4) = (V°, V3, V2, 73) = ( 


10 
7 (Ge-¥)- 


(7.112) 


Oe ania oe 
cot’? Ox’ Oy Oz 


Chapter 8 


Four Vectors of Relativistic 
Mechanics 


Chapter 4 outlined the salient concepts and formulas of Relativistic 
Mechanics. However, the style and format were non-relativistic, in the sense 
that the kinematic and dynamical quantities — like velocity, momentum 
and force — were written as 3-vectors, whereas true spirit of relativity would 
require their expressions as 4-vectors. This requirement will be justified in 
Chapter 11 in the context of the Principle of Covariance. Our immediate 
concern is now to find 4-vectors which will satisfactorily represent the above 
quantities. We shall sometimes (but not always) follow the convention set 
at the beginning of Sec. 7.8, in particular in Eq. (7.65), i.e. first write it 
as a geometrical object with no reference to its components, followed by a 
specific reference to it as a 4-component object. 

Let us be specific that the 4-vectors of relativistic mechanics that we 
are going to construct and deploy for our purpose are contravariant vectors, 
e.g. 4-displacement dY, 4-velocity V, 4-acceleration A, 4-momentum P, 
4-force F. Our immediate purpose is to build up the equation of motion in 
the Minkowski space-time M*, and then use them in the larger objective 
of constructing the energy-momentum stress tensor in Chapter 12. The 
above 4-vectors will play a key role in achieving this primary objective. 


8.1. 4-Displacement 


Consider once again the motion of a point particle in Fig. 8.1. In Fig. 8.1(a), 
we have depicted its physical trajectory C in E?, the Euclidean 3-space. 
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cI 


Physical Trajectory 


1 | ettedt 


a 
Ye 


Projection of C on XY Plane 


XxX (a) x Projection of C on XY Plane (b) 


Fig. 8.1. (a) Trajectory of a particle in E%; (b) world line in M4. 


In Fig. 8.1(b), we have shown its world line T in the four-dimensional 
Minkowski space M+, suppressing the Z-axis. 

P (a,y,z) and Q (a + dz, y + dy, z+ dz) are two infinitesimally close 
points on the trajectory C’, reached by the particle at times t and t + dt, 
respectively. The coordinates of these events are 


Op = (a) = (ct,r) = (ct, x,y,z), 


Og = (a" + dz") = (ct + cdt,r+ dr) = (ct+cdt,x+dz,y+dy,z+dz). 
The infinitesimal 4-displacement from Op to Og 
dv = (dx") = (dx®, dz, dx”, dx*) = (edt, dx, dx”, dx) (8.1) 


is the “primordial” contravariant 4-vector from which all other “truly” 
contravariant 4-vectors (i.e. those which have not been converted by the 
“raising” operation) by multiplication with scalars and differentiation 
(cf. Sec. 7.5). 

We shall define the basis vectors along the { cT, X,Y, Z} axes as é), = 
(ef, @2, 67,2), in terms of which we shall write the above displacement 
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4-vector as 


dv =e, dx" = (cdt, dr) 
(8.2) 
where dr = displacement 3-vector from P to Q. 
The time interval dt, which was treated as a scalar while obtaining the 
v, a, p and F vectors in Eq. (7.31), is no longer a scalar in the M* context, 
because the “time interval” transforms under an LT. The quantity which 
should now replace dt is dr, the “proper time” interval between the events 
Op and Og. It is then imperative that we should establish the 4-scalar 
nature of dr. 
The norm of the 4-displacement da“ is, according to Eqs. (7.110) and 
(7.90): 


ds? = c* dé? — dr’, (8.3) 


and is therefore a 4-scalar. In the instantaneous rest frame of the particle, 
dt = dr, and dr = 0. Therefore, 


ds? = c’ dr”, so that dr = ds/c. (8.4) 


Since ds is a 4-scalar, and c is a universal constant, dr is a 4-scalar. (QED) 


8.2. 4-Velocity 


We shall now define the 4-velocity ee (V") of the particle at the event 
(cz) as the following contravariant vector: 


—> 
Ve eg = al eee es 
dtr dt dt’ dt dt’ dr’ dr’ dr 


Also, as we explained in Eq. (4.37), 


dt 1 d d 
— -T- —__ that. —=P—. 8.6 
dr J1— 2/2" Oe or dt ee) 

Therefore, 
V Se Veer edt als 

eV ( al (8.7a) 
= (Te, Iv) (8.7b) 
= 1G te. Uy Ue): (8.7c) 


To avoid confusion in future, we shall denote the Cartesian components 
of the velocity 3-vectors in lower case, and with subscripts 1,2,3 to mean 
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x,y,z. We shall rewrite (8.7) as 
Vx = é) V¥ = (Tc, Tv) = T(ce, v1, v2, v3). (8.8) 


What is the magnitude of the 4-velocity V? As suggested through 
Eq. (7.110), the quantity that comes closest to “magnitude” is “norm” 
which in this case is 


VV a(V)2 HV, H=P(2 0) =PA(1—v?/2) = 2. (8.9) 
Therefore, the magnitude of any velocity is 


(V2 =c. (8.10) 


All 4-velocities have the same “magnitude”, equal to the speed of light c. 
Mount Everest and a 1 GeV proton move with the same absolute magnitude 
of speed. The reason is not difficult to find. You may be the fastest globe 
trotter. Yet when you view the world from your jet plane, you find yourself 
at rest, and the rest of the world, inducing Mount Everest, moving at 
jet speed. Since motion is relative, it is not possible to make an absolute 
judgement of which is slow and which is fast. Therefore, all velocities, “big” , 
“small” even “zero,” have the same absolute magnitude. Relativity is a 
great equalizer. 


8.3. 4-Acceleration 


The 4-acceleration A is a contravariant vector defined as av We shall 
write it more explicitly as follows: 


dr’ 
8.11) 
dV" at d?x dy d?z ( 
—oe’ Aua~@a a po Boe Ed, Gee 
ord pe eis (=) (S445). 
It is easy to see that the 4-vector A is “orthogonal” to the 4-vector V: 


A.V =0. (8.12) 


To see this take the proper-time-derivative of both sides of (8.9). Th 
side reduces to 0. The left-hand side becomes AV . V) = ov a 
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We shall go a little bit deeper and examine in what way the 
4-acceleration A is connected with the 3-acceleration 


es a = yy (8.13) 
as in N.R. mechanics. In the equations below, as above, “dot” (°) would 
mean 4. Referring to Eq. (8.7b) for differentiation, 

A= or rire, v)] =I1c, v) +10, a), ae 
or A =T(Cc, fv +Ta). 
We shall now use (8.12) to eliminate I. 
[(Uc, fv +Ta) -T(c, v) = 0, 
or F?(Pce? — I'v-v+Ta-v)] =9, 
or '(c? — v?) —Ta-v =0. (8.15) 


=. ub? 
Hence, using (4.15) [ = —(a-v). 
c 


Using the above result, we can now rewrite (8.14) in the more useful form: 


Az (Fta-v), Sla-vv + Ta). (8.16) 


8.4. 4-Momentum, or En-Mentum 


Multiplying 4-velocity Vv with the rest mass m, of the particle we get the 
4-momentum of the particle, which is an important member of the 
contravariant family. 


B= mV =m,[(e,v) =m,[(¢, ve, vy, vz): (8.17) 
According to Eqs. (4.45), (4.46), (4.79) 


m = relativistic mass = [m,. (8.18a) 
p = relativistic momentum = [m,v = mv. (8.18b) 
€ = total energy = I'm,c? = mc’. (8.18c) 


Note that we have used two different symbols for energy, namely FE in 
Chapter 4, and now € in the present chapter. The reason is that we shall 
prefer to reserve “E” for the magnitude of the electric field, as we are going 
to take up the covariant equations of electrodynamics in Chapter 11. 
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It follows from (8.17) and (8.18) that the time component of the 
4-momentum is energy (divided by c). Therefore, 


ns 
Pp = e;, p! = (<. p) = En-Mentum. (8.19) 


Energy is thus integrated into momentum into a single 4-vector, the 
4-momentum, in which energy is the zeroth component, i.e. the time com- 
ponent, followed by three space components. It will be easier to remem- 
ber the constituents of this 4-vector, and their ordering, if we give the 
4-momentum an alternative and generic name En-Mentum. 

There is a more important justification. The name 4-vector, or 
4-momentum, makes us expect four components in the 4-vector involved. 
Most often this is not true. There can be 2-components, or 3-components 
when the motion is one-dimensional, or two-dimensional. The name 
En-Mentum will steer clear of this confusion. 

This fusion of two distinct quantities, energy and momentum, into a 
single entity En-Mentum was expected, if not overdue, following our awak- 
ening to the fact that the energy and momentum conservation laws could 
not be separated into a conservation of one of them without the other, as we 
saw in Sec. 4.6. Either we get the conservation of the united En-Mentum or 
no conservation at all. Read our observation following Eq. (4.92). We can 
therefore interpret the energy-momentum transformation equations (4.89) 
and (4.90) as the transformation of En-Mentum, which we rewrite here as 
the transformation of its time-space components: 


10 0 1 é! é 
Pe See = a | ele a (8.20a) 
1 1 0 / € 
po =p — Bp’) > pp = 7 | Px — B=) (8.20b) 
p? =p*; p° =p" => p, = py; Dy = Pe: (8.20c) 


Suppose in the frame S we have pure energy €, but no momentum 
(we can think of a ball of radiation, propagating isotropically in all direc- 
tions). Thanks to € = mc’, this radiation will have a rest mass m, = €/c?. 
According to (8.20b), this “ball” will have a momentum p/, = —yc6m.,, 
same as that of a particle of rest mass m,, moving with the velocity —{c, 
in agreement with (8.18b). Pure energy in one frame transforms into a 
combination of energy and momentum in another. 
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It is seen from (8.18c) that the energy € of a particle is proportional to 
the Lorentz factor [, which can be used as a representative of the energy. 
From (8.20a), we find its transformation from S to $’ under the boost: 
S(8,0,0)S’: 

I’ = 41 (1 — Bvz/c). (8.21) 

The magnitude of 4-momentum (its “norm”) has a special significance. 
By Eqs. (8.9) and (8.17) 

(PB)? =m,2(V)2 = m2, (8.22) 
so that the magnitude of 4-momentum is m,c. However, from Eq. (8.19) 
€2 
(PY =~? - p? == -p®. (8.23) 


Cc 


Equating the right sides of the above two equations, we rediscover the 
energy-momentum relation given earlier in Eq. (4.83): 


E? = (m,c?)* + p?e?. (8.24) 
For a massless particle, e.g. photon, the above equation yields 
E =|ple, (8.25) 


as in (4.84). We shall therefore specialize the definition of 4-momentum 
given in (8.19) for a photon: 


Ce 
Pp = e;, pl! = (5. =n) for a photon, (8.26) 


where n is the direction of propagation of the photon. For a photon, 
(Py? =0 (8.27) 


so that the En-Mentum of a photon is a null vector. 


8.5. 4-Force, or Pow-Force and Minkowski’s Equation 
of Motion of a Point Particle 


The structure of 4-force F should be such that it will recast Newton’s 


equation of motion F = op in E° to Minkowski’s equation of motion (to 


be referred to as EoM frequently) in M* in the form 
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The 4-force #F that appears on the right is known as Minkowski force. 
However, we may like to call it Pow-Force, for reasons that will follow. Let 
us write 


F =a FY = (F°,F). (8.29) 


The EoM (8.28) combined with the definition of Pp given in (8.18), (8.19) 
means that 


1d€ 1_d€ II i 

= eller: a =i =F, (8.30a) 
dp dp 
alee ; 
7 7 F, (8.30b) 


where II (capital pi) = & It stands for the power received by the particle 
(same as energy received by the particle per unit time), due to (i) work 
done on it by external forces, and/or (ii) by absorption of radiation or heat 
(thereby changing its rest mass). In the case of a particle whose rest mass 
does not change, II is the same as the power delivered by the force F; cf. 
Eq. (4.81). 


According to Newton’s second law of motion, Eq. (4.47), 2 = F. Hence 


> dt 
the space component of F i is identified as TF, and the time component as 
Po = Fil/e. Then 


F=f (=F) = Pow-Force. (8.31) 


Since the time component of F is Power (multiplied by ['/c), it may be 
easier to remember the constituents of this 4-vector, and their ordering, by 
the caer and generic name Pow-Force. 

Since 4 =I4, Minkowski’s EoM (8.28) gets resolved into the following 
time and ne components: 


Time component: 


d 
Space component: — = 


The space component of Minkowski’s equation of motion is a restate- 
ment of Newton’s second law of motion. The corresponding time component 
expresses conservation of energy. It simply states that the rate of change of 
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the kinetic energy of a particle equals the power delivered by the external 
agents (e.g. forces) acting on it. 

Let us consider an important application of the EoM (8.28) to a point 
particle with a constant rest mass mo. Due to (8.17), then the above EoM 
takes the alternative form 


mA =F. (8.33) 


The above equation shows that this 4-force is orthogonal to 4-velocity, a 
fact that follows from (8.12): 


FV =0. (8.34) 


Therefore, the four components of F are not independent. We shall fix the 
time component using (8.7), (8.31) and (8.34). 


(r=) (Cc) — ((F)- (fv) =0 > =F -v. (8.35) 


Hence, for a particle with constant rest mass the 4-force must have the 
form: 


Far (=F), (8.36) 


Now the equation of motion (8.28) in M* will have the time and space 
components: 


d 
Time component: ae F -v. (8.37a) 


d 
Space component: .  (8.37b) 


8.6. Force on a Particle with a Variable Rest Mass 


Let us now be more general and consider a 4-force that can change the rest 
mass of the particle. We shall then write 


Frot =F+ K, (8.38) 


where F is the force that does not alter the mass of the particle, and is 
orthogonal to the 4-velocity. It has the same properties as in (8.33) — (8.37). 
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ra can be called a convective force, as it can alter the mass of the particle. 
The time-space components of KC can be written as 


R=1 (2.x) ins; K= (“2.x inS,  (8.39a) 


Cc 
Kigs¥ =e-V er G-K) =i, (8.39b) 
d 
where II, = — 2 (8.39c) 
iT 


is the convective power absorbed in S, and m, is the proper mass of the 
particle. We shall illustrate this case with the example of a Relativistic 
Rocket in Sec. 9. 


8.7. Lorentz Transformation of the 4-Vectors of 
Relativistic Mechanics 


We shall apply Lorentz transformation to the 4-vectors of Relativistic 
Mechanics and from the resulting relations extract the transformation for- 
mulas of the corresponding 3-vectors, and make sure that they agree with 
the transformations obtained in Sec. 4.1 and 4.2. 


8.7.1. LT of 4-velocity 


We shall apply the general transformation given in Eq. (7.78), to the veloc- 
ity 4-vector V shown in (8.7b). We shall rewrite the same as 


V =Tc(1,v), (8.40) 


where v = v/c as in (4.22), so that the quantities inside the brackets are 
dimensionless: 


(7.78a) > IY =oT(1-f-v), (8.41a) 
(7.78c) => I'v’ =T[(yv, + v1) — 7G], (8.41b) 


)_ Tom + v1) - 8) 


or, T’ ’ 


(8.41c) 


using (8.41a), vt, =v = (ye + i) = 98 (8.41d) 


y(1— B-v) 
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. vi —B 
from which V = G—B-v)’ (8.41e) 
Vi 
ie, oS, 8.41f 
“l= Ben) ve 


We get back (4.23a) for the special case of x-orientation. 
Equation (8.41a) will be found useful in many applications. Hence, we 
shall write it, and its inverse, as two separate equations: 


I’ = 1(1-B-v), (8.42a) 
P= 7(1+8-v’). (8.42b) 


8.7.2. LT of 4-acceleration 


The components of A are given in (8.16). To make the formulas less com- 
plicated we shall specialize the expression for A to rectilinear motion, along 


the X-axis, and get the following expression for A, having only ¢ and x 
components: 
T4 [4 2 T? 2 
Aa Aa peared iba rta. (8.43) 
c C? C? 


In short, 
A =Ta (<. 1) (8.44) 
Cc 


where a is the acceleration along the X-axis. We shall now let K > A, 
and apply the transformation (7.77a). 


10 _ 0 1 m0 na (U 
A“ = 7(A’ - BA) ST’ — =7T%a(-- 86), (8.45a) 
c c 


Al = 7(Al — BA°) > P4a! = 9740 (1 = 6) (8.45b) 
Cc 


Let us now take the S’ frame to be the IRF, so that 


v =0, IY =1, v=fe, y=TP. 


Now go back to (8.45). Equation (8.45a) yields 0 = 0. Equation (8.45b) 
gives the acceleration in the IRF, also called proper acceleration, as 


a’ =T*a(1 — 6") =T*a, (8.46) 


same result as in (4.29). 
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8.7.3. LT of 4-momentum 


We shall take the same steps as for 4-velocity. Recall the time and space 
components of 4-momentum as given in (8.19): 


PB = (=. p) , E=Im.c, p=Imy. (8.47) 
6; 


Apply the general transformation given in Eq. (7.78) to these components. 
First the time component. From (7.78a) 


or I’m,c = y(Tm.c — 8-T'm.v), (8.48) 


or IY =7T(1-£6-v). 
For one-dimensional motion, 


1’ =P (1- By), 


(8.49) 
T=!I"(1+ 8v’). 
We get back Eq. (8.42). Now the space component. From (7.78e) 
j E 
p=pt+n/(y—-1)-p)—78—|. (8.50) 


We shall get back Eq. (4.23) if we use the definitions of € and p as given 
in (8.47). 

The inverse transformation, corresponding to the boost: {S’(G)S} is 
obtained by replacing n with —n: 


pap tn|(—1)(-p) +995], (8.51) 


8.7.4. LT of 4-force 


The components of the 4-force are given in (8.31) which we rewrite here 
I 
Far(AP), (8.52) 


Here F is the 3-force as defined by Newton’s second law of motion ie =F. 


Apply the general transformation given in Eq. (7.78) to these components. 
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First the time component. From (7.78a) 


IT’ Til 
(é 


Cc 
r 
or I’ = 47 (-v-F), (8.53) 
ll-v-F 
using (8.42), II’ = Ta 


Now the space component. From (7.78e) 


Tu 

IF’ =TF+n | (y-1)n-TF)-78— |}, 
c 
(8.54) 
_F+n[(-D@-F)-784] 
yl — a+r), 

For a particle moving with constant rest mass, I] = F-v =F. cv. In this 
case 


using (8.42), F’ 


_ Fal -D@-F) +e F- | 
7 y= EP) 

The inverse transformation, corresponding to the boost: {.5’(3)S}, is 
obtained by replacing n with —n: 
_F4¢n[(7-D(n-P) +1BF 
7 —— ase 


Corollaries of the force transformation formulas: 


F’ (8.55) 


F (8.56) 


(1) A longitudinal force F\ (i.e. a force in the direction of the boost velocity 
CQ) is invariant under LT. 
Proof: Let S’ be the rest frame of the particle. Hence, v’ = 0. By 
assumption, n(n- F’) = F’. Hence, from (8.55) 


F) = Fi. (8.57) 
(QED) 


(2) A transverse force F’, in the rest frame S’ of the particle transforms 
to the F, /y in the Lab frame S. 
Proof: By assumption n- F’ = 0. By a similar argument 
= Bi 


Y 


Fi (8.58) 


(QED) 
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Y 
5A 


Fig. 8.2. Longitudinal and transverse force and area cross-section in the rest frame. 


(3) Pressure p in a perfect fluid is Lorentz invariant. 
Proof: By perfect fluid we mean a fluid in which there is no shear 
stress. If we consider an infinitesimal rectangular box inside the fluid 
(Fig. 8.2), the stress on all the bounding surfaces will be the same 
normal compressive stress p, called pressure. 

Let the pressure inside the fluid at a certain event O, in the corre- 
sponding instantaneous rest frame 5S’, be p’. Referring to Fig. 8.2, the 
force on the right face is 6F’ and the area is 6A!,. Similarly, the force 
on the top face is 6F%, and the area is 6A’,. Hence, p’ = — = —. 

Apply Rules 3, 4 of Sec. 2.7 to the above surface areas. The lon- 
gitudinal dimension contracts, but the transverse dimensions do not. 
Hence, 6A’, + dAy = 6A),/7; 6A, + 6Ax = JA. If we now apply 
(8.57) and (8.58), we get 


bF, 6F, 


Po= 5A, oAL 


a bAy OAL fy , 
Hence, p(x) = pz(x) = py(z) = p'(2). (8.59) 


(QED) 
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8.8. Conservation of 4-Momentum of a System of Particles 
8.8.1. Zero momentum frame, equivalence of E and mass 


Consider a system $ of N non-interacting particles, their individual rest 
masses, velocities and En-Menta being {mj,v', Bp’ = T’mi(c,v'); i = 
1,2,...,N}, with respect to some inertial frame S. The total energy, total 
Momentum and total En-Mentum of this system are as follows: 


€=S Timic?, P=S Timiv', B= (E/c,P). (8.60) 
i= 1 al 


In performing the summation in the above equation, we count each of 
the N particles at the same instant of time, i.e. simultaneously. On the 
space-time diagram the counting events lie on one X Y Z-hyperplane. 

Let us consider the boost: {S’ — (nf) — S}, and apply the Lorentz 
transformation formula (7.79a) and (7.79e) to the En-Mentum P: 


Efje=7lE fe+ B+ P'|, (8.61a) 
P=P’+n{(y-1)(m: P’) + y8(E'/o)]. (8.61b) 


The above transformation equations are valid, a priori, for the En-Menta 
(€'/c, p*) of individual particles referred to in (8.60). However, due to the 
linear character of the transformation equations, they are also valid for 
the total En-Mentum. The 4-vectors (€/c, p), (E’/c, p’), in Eq. (8.61) now 
stand for the total En-Mentum 4-vector in S and $”’, respectively. 

For obtaining €’,P’ in the above equation, we perform the summation 
by the same procedure as for Eq. (8.60). However, in this case the counting 
events are simultaneous in S”, but not so in S. This should not cause any 
problem, because the particles being non-interactive, and under no external 
forces, keep their individual En-Menta constant as they progress along their 
respective world lines. 

What happens when two of them collide. The collision of two particles is 
one event, a single point on the space-time diagram — same point whether 
in S or in S$”, although their coordinates will differ. 

In Sec. 4.9 we have presented an example of elastic collision between a 
photon and an electron, known as Compton scattering. The En-Mentum is 
conserved, and the CM of the system keeps moving on as a single particle 
along its world line. 
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Suppose the collision is perfectly inelastic, i.e. they coalesce and form 
a new particle C. Even then the En-Mentum of C will be the sum of the 
En-Menta of A+B, just after collision. That is, E& = E4 + Es; po = 
p4a+pe. However, in this case part of the energy Ec goes into the inner 
energy of the coalesced particle C, say in the form of “excitation energy”. 
Problem 4.6 gives a good example of this process. In this case also the CM 
keeps moving on as a single particle, just after the reaction. 

Let us go back to (8.61). Of all frames 5” there will be one in which 
P’ = 0. We shall identify this frame as Syest, and call it the rest frame of the 
system §, or better still the zero momentum frame (ZMF or ZM frame) of 
the system. This is the analogue of centre of mass frame of non-relativistic 
classical mechanics. 

We go back to Eq. (8.61), set P’ = 0, denote the energy in Syes, as €° 
and obtain: 


E= ne", (8.62a) 
P = nyB(E°/c) = yv(E°/c?) = Mo =T'vM), (8.62b) 


where, Mo = eof = rest mass of the system © in the frame S. 
(8.62c) 


This brings us to the following conclusion: 


Conclusion: The system of particles can be considered to be a single 
particle at rest in Syes, and moving with the boost velocity v with respect 
to S. It has a rest energy €, rest mass Mo, and the two are related by the 
relation Ey = Moc?. Its momentum P as shown in (8.62b), is in accordance 
with the definition of the momentum of a single particle as given in (4.44). 

We have replaced the Lorentz factor y associated with the LT, with the 
dynamic Lorentz factor T associated with the motion of a moving particle 
(see Eq. (4.38)). In this case, the particle velocity is identical with the boost 
velocity. 

Thus the inner kinetic energy To of the system contributes to the inertial 
mass of the system by an amount equal to To/c?. 

We shall illustrate the concepts with a few examples. 


8.9. Illustrative Numerical Examples III 


We shall now specialize our discussion to a system in which there are two 
particles before a collision, and two or more particles after the collision. 
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The collision may result in the production of a new system of particles 
which may or may not bear any resemblance with the original ones. One 


such example is the production of a pi-meson like 7°,7* from a nucleon— 
nucleon collision (p+p, p+n), another can be the example of the creation of 
an electron—positron pair when a y-ray hits an electron. In such cases, the 
En-Mentum of the system may not have the simplest expression as given 
in (8.17) in which the rest mass m, and the dynamic Lorentz factor T have 
been delineated clearly. 

Even when the En-Mentum looks different, we can still filter out the rest 
mass and the dynamic Lorentz factor by recasting the expression suitably, 
as in the following example: 


B = (A,B,C,D) =m, ( Z )( aE at), (8.63) 


mc} \° A’ A’ A 
This is in the form of Eq. (8.17), if we identify 
A Be 
m, ¢ —_ Is 7 = Ug; etc. (8.64) 


8.9.1. Example 1: Relativistic billiard balls 


The example we shall now discuss, and work out the details, will clarify the 
notion of the ZM frame, in particular how to find this frame, and illustrate 
how this intermediary will help us find a solution to the problem. 

Figure 8.3(a) shows two identical billiard balls A and B of equal rest 
mass m,. B is at rest, and A is approaching B along the X-axis with 
relativistic velocity CVA_in = CVA-in @x. The dynamic Lorentz factor corre- 
sponding this velocity is ain = 1/./1—v3_,,. After the collision, A and 
B bounce out with velocities cv aout and CYp-out, respectively. Our task 
is to calculate these velocities and the angles @ and ¢ they make with the 
X-axis. 

Figure 8.3(b) shows the billiard ball experiment from the ZM frame. 
Due to symmetry both balls approach each other and then fly apart with 
the same speed cv’, making angle 6’ with the X-axis. The dynamic Lorentz 
factor for both particles is IY = 1/V1—v?. 

We shall proceed towards the answers in several steps. 


Step 1: Find the velocity of the ZM frame with respect to the Lab frame. 


For a while we shall suppress the subscript “a-in” iN Va-in, Da-in, so that 
the equations look less cumbersome. T and v in the equations below will 


mean T,y-jn and v_jin. 
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v’ Y ee 
<—_—\ , Y’ 
S add v 
Lab Frame ——— Ka 
to vectors in S’ ZM frame 
to get vectors inS 
Va-out 
A 
A Vain B @ x 


(a) 


Fig. 8.3. Elastic collision between two equal balls seen from S and S’. 


Let c6 be the boost velocity, y = 1/./1— 8? the boost Lorentz fac- 
tor. We shall write the individual and total En-Menta before the collision. 
Since the encounter takes place on the X Y-plane, we shall write only three 
components of En-Mentum in the format P = éfp" = (p°,p', p?) (as in 
Eq. (8.17), but without p?). In the Lab frame, represented by S, the rele- 
vant 4-vectors are as follows: 


Prin = Gi, Dh = Tm.e (4 VY, 0), (8.65a) 
Pas = e/, Pe in = MC (1, 0, 0), (8.65b) 
Bn, = B vin + Pin =m,c(T + 1,Tv,0). (8.65c) 


LT will convert Pin Pi with components (P,°, P,!, P.2) of which 


in? in? 


the first two are non-zero: 
Pe = (0 +1) - flv) me, (8.66a) 


Pi=y(lv—B(l+41)) me. (8.66b) 


To identify the ZM frame we must set pS =0. This leads to 


B= (8.67) 
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Going back to (8.66a), and using (8.67), 


if Tv)? 
Po = T+1 _ | 
is v{+1) rob mec 


yM,C 
T+1 
= 2ym,¢. (8.68) 


{l? +2r+1-T7v*} 


We have used the fourth identity from Eq. (2.22). The total En-Mentum 
of the 2-particle system in the ZM frame can now be written as 


B' = 2m,c(1,0,0). (8.69) 


Particles A and B, together representing total energy (T+1)m,c? in the 
Lab frame S' (see Eq. (8.65c)), transforms into a stationary single particle 
with rest mass energy 2ym,c? in the ZM frame S$’. 

We can find a relation between [ and y by transforming P' > rad 
from the ZM frame S’ to the Lab frame S (using Eq. (7.77a) and setting 
6 + -8). 


PS, = (Pip + BP) = (27 + 0)moc = 27?m,e. 
But, P? =([+1)m,c, from Eq. (8.65c). (8.70) 


Hence, [ + 1 = 24?. 
We can transform this into another useful relation, using the identity 
2 — 2 A2: 
y-laVkR: 
P= 27? -1=7°(14+ 8"). (8.71) 
Step 2: Find the En-Menta of the balls before and after the collision. 


In the ZM frame, represented by S’, the relevant En-Menta are as 
follows: 


Piss = Cy Prin = I’m,c (1, v',0); 
Poin = Cp Pb in _ I’m,c (1, =", 0); 

(8.72) 
(iii) Bot =e pt oe =I'm.c(1,v' cos 6’, v’ sin 6’); 


=>) _Ww j i is j 
eC), PB-out = L'’m.c(1, —v' cos’, —v’ sin 6’). 
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We shall now find the components of the same 4-vectors in the Lab 
frame S' which is the rest frame of B. Hence, the boost velocity — is the 
same as the velocity of B in S’, and the boost Lorentz factor + is the same 
as the dynamic Lorentz factor I’: 


1 1 
Bau’; y=T' = —, = —— (8.73) 


T=? i= 


First, the in-coming components, by LT formulas (7.77a), with boost 
velocity —{, applied to the components shown in lines (i), (ii) of Eq. (8.72). 


Prin = (PR in TT BD's in) = ql nse (1 + B x v') 
= (1+ 6?) mc, 


Pain = Y(PAin + BP'Rin) = W'moc(’ + B x 1) (8.72a) 
= (2778) me, 


Pain — Din = 0; 
Pin = 1(PB-in + 8 PB-in) = WI’'m.c(1 + 8 (-u’)) 


= 77(1 — B?)m.c=™m.¢, 
(8.72b) 


PB-in — 1(PB-in Tv B PB-in) = ql msc(—0! + B x 1) = 0, 
Pein = PB-in = 0. 
Next, the out-going components, by LT of the components shown in 
lines (iii), (iv) of Eq. (8.72): 
ee = VR cay + BOK cat) = 7I’m,c (1 + Bv ’ cos 0") 


= 7° (1+ B? cos 6’) mc, 
Diccan = ON cst + BPP out) = yI’mec (v ' cos g’ + B x 1) 
(8.72 c) 
= 7°B(1 + cos 6") m,¢, 
eer = De out — Ip y sin 6! MC, 


= 78 sin 6 m,c. 
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Dewi — (PB out ar 8 PB-out) = qLlime (1 te B(-v') cos 6’) 
= 77(1 — 6? cos 6’) m,c 
PB-out = Y(PB-out + BP B-out) _ yI’m,c (= cos 6 + B x 1) 


(8.72.4) 
= 77 B(1 — cos’) mc 


2 —, al 2 
PB-out = PB-out 


= —78 sin @’m,c. 


Compactly, 


2 
Pain = mc (y2(1 + 62), 2728, 0) = m.ey2(1 + 6) (. —— 0) 


Beni = m.c(1, 0, 0), 


(8.74) 
P isk = m,c (77(1+4+ 6? cos 6’), y7B(1 + cos 6’), 7B sin 6’) 
= 2 2 / yG(1 + cos 0’) Bsind’ 
=m,cy (1+ B* cos 6’) (1. (1 + BE + B2 e086’) ¥(1 + B? cos 6’) cos 8") J’ 


ae = m,c (y7(1 — 6? cos 6’), y78(1 — cos6’), —yf sin 6’) 


A yB(L— cos@!) Bsin 6’ ) 


(1 — 82 cos6’)’-y(1 + 8? cos 8’) 
(8.75) 


= m,cy*(1 — 6? cos 6’) (1. 


Going back to Eqs. (8.74) and (8.75), 


Bin _B -in Pp -in =2 2 s 1 0 
A. + B ym c( ’ B, ) (8.76) 
1, 8,0 


Bout = Bice + Bcc = 277m,c( ry ; ) 


2 
2 


confirming that Pin, = Pont in the frame S. 
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Step 3: Find the velocities of the balls before and after the collision. 


Going back to Eqs. (8.74) and (8.75), 

sags 

=T3 

yB(1 + cos 6’) | 


Tain = 7°(1 + 6’); VA-in 


Ta-out = (1 at BP cos 6’); VA-out-x = 


(1 + 82 cos 0’)’ 
_ Bsin 6 
ae (1 + 8? cos 6’)’ 
1 — cos 6’) 
T -out = 1- 6’ } -out-x = nee — cont) } 
B t 1" ( B cos ie VB. t v1 = Be cos 
—Bsin 0’ 
VB-out-y = —————- 
Bouty ~~ 4(1 — B2 cos 6’) 
We can now find the angles @ and ¢: 
ate in 0 1 6! 
tan @ = cava = — = —tan—, 
Va-outx Yil+cos6’) ¥ 2 
VB-out-y sin 0’ 1 g’ 
4; SS SS — G— 
me VB-out-x y(1 — COs 6’) Y - 2” 


1 
tané tang = —. 
x 


(8.77a) 


(8.77b) 


(8.77c) 


(8.774) 


(8.77e) 


(8.78a) 


(8.78b) 


(8.78c) 


Note that the above relationship between 6 and 0’ on the one hand, and 
@ and 6’ on the other is consistent with the angle transformation formula 
given in Eq. (4.133). The exercise to confirm this is left to the reader. 

We can now use Eq. (8.77a) to express y? in terms of I',.;, using the 


identity y? — 1 = 6?4?: 


Pain = 272-1 
2 Tain +1 
9 2 
1 
where ['y-in = 


SS: 
V i Yin 


(8.79) 


We have now a more convenient equation for the angles in terms of the 


velocity of the impinging particle A. 


2 


tan @ tan oO) = Taz. 41 7 
A-in 


(8.80) 
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The reader should work out the exercise R1 in Sec. 8.1 to get a full 
appreciation of the equations presented in this section. 


8.9.2. Example 2: Threshold energy for a p+ p collision 
resulting in the production of a x° particle 


We shall consider the following example?®: 
pi t+ pe > py + py +7°. 


The left side represents colliding protons # 1 and # 2, before collision, 
and the right side the same protons after collision. The proton p2 represents 
a stationary target (hydrogen atom), and the proton p; the bombarding 
particle. There is a certain minimum kinetic energy Ko, called the threshold 
energy, above which the above reaction can take place. We shall determine 
this energy, using the ZM frame as a stepping stone. 

We have shown the reaction in Fig. 8.4. In the Lab frame, the target 
proton p2 is stationary before the collision. In the ZM frame, the target 
proton pz and the bombarding proton p; are moving with opposite velocities 
{8c and —{c before the collision and settle down to rest along with the new 
born 7° after the collision. 

We shall adopt the following rest mass values: mz = 135 MeV, mp = 
938.26 MeV. Now apply energy conservation in the ZM frame: 


2Pmp = 2Mp + Mz, 
Ma 135.0 


ce Were site Sia 
Now, B? =1— Tr? = 0.13. 


Hence, 6 = 0.36. 


oo re pat 
| 2 Lab frame = eM frame eee | 
a a see | (BB 1 pen: 

SS a | 8 * ae 

I i agai 1 Hl 2 
2 Be ce 3 ae 
before i posseape pases after 
iipen before if 
xX x’ 


Fig. 8.4. Nuclear reaction p +p > p+p-+7° seen from two frames. 


4See [25, p. 186]. For the actual reaction, see [37, p. 689]. 
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To get the velocity 89 of the bombarding particle p; in the Lab frame, we 
shall apply the velocity transformation formula (4.7), to go from the S’ 
(ZM) frame to the S (Lab) frame. 


B+B 2 x 0.36 
Bo 1+8xB 1+0.36 x 0.36 pest 2) 


The corresponding dynamic Lorentz factor is 
1 1 


lo = —— _ = ———_- = 11.3. (8.83) 
Vl=8 W1—0.637 
The kinetic energy of p; should be 
Ko = (To — 1)mpc? = 0.3 x 936.26 = 281.478 MeV. (8.84) 


This much is the minimum kinetic energy to be given to the bombarding 
proton to produce a mass which is less than half this value. The remaining 
half is shared by the three particles emerging from the reaction. 


8.9.3. Example 3: Threshold energy for a photon hitting 
an electron to produce an electron—positron pair 


We shall consider an example of (e, e_) pair production by photons?: 
Ware. se +et+e. 
This example is similar to Example 2, except that a gamma ray, which is 
a particle of zero rest mass, now plays the role of the bombarding particle. 
We have made a picture of this reaction in Fig. 8.5. 


Let (€o,€) represent the photon energy in the ZM frame and the Lab 
frame, respectively, and similarly ((o,I) the dynamic Lorentz factor of the 


Pat Y’ 
rere i ee | 
= Lab frame S"_|ZM frame 
poe teeta ee a lL fat cine aes Sak 
\ Hail e ; ' a +1 
~~ bg eT ig e~ e- 
Blinnnn— > © ie Or, “nn <@ ‘ 
ti ew 
Eb eth Seat tee. a |e See 2 ore serene 
before | before after 
| _Gfter ™ | 
xX xX 


Fig. 8.5. Nuclear reaction y+ e7 + e~ + e+ +e7 seen from two frames. 


>See Ref. [26] and Problem 7-2 on p. 225 in Ref. [25]. 
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electron in the same two frames, and let m be the rest mass of the electron. 
Let —po be the electron momentum in ZM. Now apply the conservation 
equations in the ZM frame: 


Momentum : Eo/c = po = To Bme. 


Energy : €5 + Pomc? = 3mc?. 


Hence, 3-—To =To8, 
or To(1+8)= nt ee) 
via? 
from which B = To i - 7 >. 
Eo = poc = To 8mc? = ; x sme? = ame 


Now consider the boost:S’(—8)S from the ZM frame to the Lab frame. 
Apply the 4-momentum transformation (8.48) to the energy of the photon. 
In this case, the boost velocity (6 is the same as the electron velocity @ in 
the ZM frame, and the boost Lorentz factor Ypoost is the same as 9. Note 
that we have added a subscript to distinguish the boost Lorentz factor from 
the gamma ray: 


E E E 
— = boost - + Bre] =To - + a) ; 
6; (és Cc 


45 4 4 4 2_ {5 4 _9 2 2 (8.86) 
or €= 3x (F4 Ex 5) me = (Fx 5% 2) me =4me 


=4x 0.511 = 2.044 MeV. 


As in the previous example, we need a photon of minimum energy 4mc? 
to produce an electron—positron pair of rest mass 2mc?. The balance 2mc? 
goes to the kinetic energies of the pair (e~, e+) and the bombarding particle 
e~ emerging after the impact. 


8.9.4. Example 4: Compton scattering and inverse 
compton scattering 


1. (a) Consider a collision between two particles A and B, moving with 
velocities u, and uy, their respective rest masses being moq and mop. 
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Their 4-velocities are as and Ue. Show that 
Uz -U, = eT (8), (8.87) 


where I'(3) is the Lorentz factor corresponding to relative velocity cG 
between A and B. 

Hint: Since the quantity is Lorentz invariant, evaluate the product in the 
frame of reference in which one of the particles, say B, is at rest. Now write 
the 4-velocities in this frame. - 

(b) Let their 4-momenta be P, and P,. Show that 


P, - Py = 2T(B)moamos. (8.88) 


(c) Let the particle A represent a photon propagating with frequency 1% in 
the rest frame of B. Show that 
> > 
P, - Py = hip~Mop. (8.89) 
(d) Suppose both the particles are photons. In the Lab frame, their fre- 
quencies are v, and 1%, and the angle between their paths of propagation 
is 8. Show that 


h 2 
Pir (=) VaVp(1 — c08 8). (8.90) 
Cc 


2. Consider the elastic collision A+ B-— A’+B’. Let A, A’ represent a 
photon and B, B’ a subatomic charged particle (e.g. an electron, a proton) 
before and after the scattering. The photon is propagating in the directions 
n and n’ before and after collision. We write their 4-momenta, and the 
“magnitudes” of the corresponding 4-momenta, as defined in Sec. 7.9.4, 
and written in Eqs. (7.110), (8.19), (8.22), (8.26), (8.27): 


B, = (hv/c)(1,n), PB’, = (hv /c)(1,n'), (P,)? = (P*)? =0, 


. (8.91) 
P.=mJ(cu), PlamI(ew), (Pe)? = (Bi)? = me. 
From conservation of 4-momenta: 
PB, +P. =P +P, 
(8.92) 


or Py + Be — Pi, = Pt 
Square either side and use Eq. (8.91) 
aly ima ee sees A (8.93a) 
h?vv! m,Ch 


(l-n-n')= q [((v —v)c—u-(vn—n'v')].  (8.93b) 
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Apply formula (8.93) to Compton scattering. Take u = 0, the initial 
velocity of the electron. Now obtain the formula (4.124), which had derived 
in Chapter 4. 


Ans. Here [ = 1. Also n- n’ = cos0, where @ is the angle between the 
initial and the final directions of the photon. Inserting these values, we now 
get 


2 yy yf 
: = (1 — cos @) = m,h(v — v’), 
5 (8.94) 
mh(y—v')er _ (1 0) 
or a 7 = cost). 
Now 
va=7):. ©. ©.) 
a a rN — x. (8.95) 
Hence, 
' h 
N-rA= (1 — cos). (8.96) 


) 


We get back the same Compton scattering formula (4.124), which had 
derived in Chapter 4 using a long route. 


3. Apply formula (8.93) to inverse Compton scattering. An electron mov- 
ing along the negative z-axis, with a high velocity u = —ue,, collides with 
a photon of energy hy proceeding along the positive X-axis. After the colli- 
sion the photon bounces back to the negative X-axis with energy hv’. Find 
an expression for the energy of the outgoing photon 

Ans. Write the three 4-vectors involved in this problem, keeping the 
(time, x) components only. 


B, = (hv/0)(1,1); Pi, = (hv’/e)(1,-1); Be =m,Fe(1,—u/c). (8.97) 


We shall rewrite the formula (8.93) by setting n-n’ = —l,u-n = 
—u,u-n! =u. 
hévv' Th 
> (1—n-n’)= ~ [(v —v')e—u-(n—n'v’)], 
h?vv' Th 
or 2 = = ~ [(v —v’)o+(v +0')u] (8.98) 
moUh 


= [let uy — (e— uv] 


= (mTh) [1 + u/e)v — (1 —u/c)v’). 


262 Special Relativity, Tensors, and Energy Tensor 


Now, in inverse Compton scattering the velocity of the incoming particle 
is very high, almost equal to the speed of light. We can therefore make the 
following approximations: 


(l+u/c) = 2. 


1 uw 
Also, ro 1— a (1—u/c)(1+ u/c) & 2(1 — u/c) (8.99) 
1 
l-u/cex 5P2" 


We now go back to (8.98) and write: 


h?vv! 1 
2 ree 2(m.Th) |» - =| 
C v= =DPme 1 vy 
ay i — 8.100 
. 2(hv)? v hv | 4T? “| ( ) 
i mec \ i I'm,c? 
- 4D hv ) v hv 
Let us write the left-hand side as 
m,c? (4D hv 
lLhs = — 1.) 8.101 
© OE hv (— ” ) ( ) 
Then 
/ 4? 
a (8.102) 


0 (Be) 
We can also write this in terms of the initial and final photon energies. 

Ear? 

(EE) 
4. Consider the case in which the incoming electron energy is 107 MeV 
(about 2000 times its rest mass). The initial photon energy is almost the 
same as (or more than) the energy of the electron. Show that the incoming 


electron transfers almost all of its energy to the y-ray. (Taylor and Wheeler 
(6, Example 8-23, p. 270]) 


(8.103) 


Ans. In this case T ~ 2000, so that “+1” in the denominator can be 
ignored. Using this approximation in (8.103) we get the energy of the back 
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scattered X-ray: 
€' =Tm.,c’, (8.104) 
which is the energy of the incoming electron. 


5. Take a photon of Cosmic Microwave Background, having photon energy 
hv equal to 1073 eV. A proton in the Cosmic ray of kinetic energy 104 
MeV collides with it. Proton rest mass energy is 939 MeV. For simplicity 
take it as 10? MeV. Find the energy of the photon after the collision (See 
[4, p.123]). 


Ans. We shall use Eq. (8.103), but replace m, with m,, the mass of a 
proton. In this case 
([ — 1)mpc? = 104, 
or, r—-1#10". (8.105) 
Take Tr =101'. 


The denominator in (8.103) is 


AVE 4x10" x 107-3 
denom = = Ste ee (8.106) 
MpC? 109 
Hence, 
4 1 22 
= a - x 1073 = 2.9 x 109 eV = 2.9 x 1013 MeV. (8.107) 


The extremely “weak” photon gains an enormous energy of ~ 1013 MeV 
from the impact with the cosmic ray proton. 


8.9.5. Example 5: Doppler effect 


Example 1. A fast moving space station is moving with speed (c along 
a straight line and is emitting light at a frequency fo. An observer O is 
located at a distance h from the straight line shown in Fig. 8.6. We are 
required to calculate the frequency f of the same light as detected by the 
observer as function of time. 


The first thing to remember is that when an observer O receives light 
from a moving source S at time t, this light comes not from the present 
location C of the source, but from the retarded location B of the source, 
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# 


-h/c 9 t retarded time 
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Fig. 8.6. Source moving at impact parameter b. 


at the coordinate x, and emitted at the retarded time t’, as explained in 
Fig. 8.6. These two times are related by the equation: 


c(t-U)=R, (8.108) 


where FR is the distance between the source S' and the observer O at the 
present time t. When we use the Doppler formula (3.31) 


ro fo 
~ (1+ Bcosé)’ 


the angle @ is the angle at which the source was located at the retarded 
time t’. That is 


(8.109) 


av 
g=—. 8.110 
cos R ( ) 


Our first task is to find a relationship between x and t. It is assumed 
that the source is moving with speed c{ along the X-axis. We set the origin 
of the X-axis at A, vertically above O, and set time t = 0 when light emitted 
from A is received at O. We shall find the solution in several steps. 


Step 1: Find t/,, i.e. the time when the source was at A. 


In formula (8.108) sett =0, f =t,, R=h. 


ctl, =—h. (8.111) 
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Step 2: Find a relation between the time ct’ and the displacement <x. 
Referring to Fig. 8.6, S was at A, when ct’ = ct’, = —h, at P when time 
ct’ = 0, at B when time ct’ = ct’. S is moving with speed c3. Referring to 
Fig. 8.6 


z= AB = AP + PB = Bh+ Bet’. 
x (8.112) 


a 


Step 3: Find a relation between ¢ and x. 
From (8.108a) and (8.112) 


_h —h 
ct=rtet! =r 42 W eee (8.113) 


which is a quadratic equation. To get the roots we simplify the above equa- 
tion by setting h = 1, y =ct to the form: 


a? (1 — 6”) — 248 (x +1) + B? x(x +2) =0, (8.114) 


z= —7°B(x +1) x {-14 1-0-#) (1-45) - (8-115) 


If we set y = 0, corresponding to t = 0, we get two solutions: 


g=—y'8 x (-1+1) =0, 2778. (8.116) 


The first solution « = 0 corresponds to light originating from the emitter at 
t = —h/c (retarded location) and coming vertically down to the observer at 
t = 0. The second solution corresponds to light starting from the observer 
at a t = 0 and reaching the emitter at « = 2776 (advanced location) after 
time R/c. here R? = h? + 27287. We shall adopt first solution (i.e. with 
the + sign). 

Now we go back to the Doppler formula (8.109) and set 


cos @ = ithh=1 
= —::_.. WI1 — ; 
Jae + h2’ ; 
(8.117) 
pa 
fo 


Before we plot the frequency ratio as a function of time, we shall special- 
ize Eq. (8.109) for two specific cases: 9 = 7/2 corresponding to transverse 
Doppler effect, and 6 = 0, corresponding to longitudinal Doppler effect. 
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We collect the formulas for these two cases from (3.32) 
1 


Ttrans = 


? 


» 4 _ fice (8.118) 
ong (148) Y1+8 


Before plotting the frequency ratios for three values of the emitter velocity, 
namely 6 = 0.8,0.9,0.95, we shall obtain numerical values of the above 
ratios so that we can check the same values are obtained from the plots. 


Tirans = 0.6 for 8 =0.8, 0.435 for 6 = 0.9, 0.312 for 6 = 0.95, a 
: 8.119 
Tiong = 0.333 for 8 = 0.8, 0.229 for 6 = 0.9, 0.160 for 6 = 0.95. 
In Fig. 8.7, we have plotted the frequency ratios as a function of the obser- 
vation time. The above values are clearly reflected in the plots. 

We shall add a numerical “feel” of the results obtained, assuming the 
light is emitted by a sodium gas vapour lamp of which the wavelength is 


ae) 6 doplerD-2007 13.fig 
an iy " ; I I y(.8,x 
5 S ae 
> => 
1S) iS) 
= [| &s 
Ss) %© 
= = 
> > 
is) ins) 
~~ ~ 
Ay Ay 
3 254.5 -- 
me 
8 
= 
= 203.64 
S 
Z 
152.7 —- 
101.8 _L 
50.9 + 0 poeee-- 22-4 rs bennneennnd prcscoanees 
ct (meters) 
0 0 1 2 3 4 5 


Fig. 8.7. Frequency ratio as a function of observation time. 
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5890 A. The corresponding frequency is fo = 5.09 x 10'* Hz. The observed 
frequencies will then be f = I x5.09x 10!4 Hz =  x509 THz. The observed 
light falls in the Near Infra Red region, as shown in Fig. 8.7. 


8.10. Exercises for the Reader ITI 


R1 Consider elastic collision between two billiard balls A and B as in 
Fig. 8.3. The velocity of the incoming ball A is vin = tc. Let the scat- 
tering angle (i.e. of the ball A after the collision) be 6 = 30°. Find the 


following quantities: 


(a) The dynamic Lorentz factor of the incoming ball A. 

(b) The boost velocity cf (of the ZM frame with respect to the Lab frame), 
and the boost Lorentz factor y¥. 

(c) The bouncing angle ¢ of the ball B after the collision. 

(d) The 3-momenta pa-in, PA-out; PB-in, PB-out Of the balls A and B before 
and after the collision. 

(e) The total energies Ey-in, Ea-out, Ep-in, Ep-out of the balls A and B before 
and after the collision. 

(f) The En-Menta of the system of two balls before and after the collision. 

(g) The rest mass Mo of the system of two balls in the Lab frame S. 


Ans. (a) Pain = 3 


(d) 


(b) B=3,7= 4, (c) $= 524°, 
PA-in = ™,C (1.33, 0), 
Ppan = m,c(0, 0), 
(8.120) 
PaA-out = ™.¢ (0.92, 0.36), 


PB-out = ™,c (0.41, —0.36). 


(e) Ex-in = 1.67 m,c?, 
EB-in = MC’, 
(8.121) 
Ex-out = 1.46 M,C, 


Ep-out = 1.20m,c?. 
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(f) Pi, = 266 m,e(1, 0.5, 0), 
(8.122) 
PB out = 2.66m,c(1, 0.5 ,0). 


(g) Mo = 2.66 m.. 


R2 Consider the same scattering as in the previous exercise. Assume sym- 
metric scattering of the two balls after collision, so that ¢ = @. Let the 
dynamic Lorentz factor of the incoming particle be T. Show that the balls 
will bounce out with an angle O between them given by the formula: 


K 
= ——___ 12 
cos 9 cane (8.123) 


where K represents the kinetic energy of the incoming particle A. 


R3 A high-pressure sodium lamp contains vapourized sodium at a tem- 
perature of 2700 K. The sodium molecules are in random thermal motion 
with an average kinetic energy of $kT, where k = 1.38 x 10778 J/K = 
Boltzmann constant. A young scientist analyzes the spectral lines of the 
light emitted by the gas. Take the mass of a sodium molecule as m = 46 u. 
The wavelength of the sodium line is 5890 A: 


(a) Find the average velocity of a sodium molecule. 
(b) Find the broadening of a typical spectral line. 


Note that you will get the same answer if you apply the non-relativistic 
formula. 


Ans. (a) 5.7x 107%. (b) 0.065 A. 


R4 Consider the decay of a A-particle (cited earlier in the first problem in 
Sec. 4.11). Initially moving with a relativistic speed, it decays into particle 
#1 (proton) and # 2 (pion), leaving an angle @ between their tracks, as 
seen in a bubble chamber. Let P“, pi’, p$ represent the En-Menta of A, p 
and m7, respectively. Using conservation of En-Mentum and invariance of 
its norm, show that 


M?ct = mect + m3ct + 2E\ Ez — 2pip2c" cos 6, (8.124) 
where p; = |p*|, i = 1,2. 
[Hint: Simplify both sides of the equation: P“P,, = (pi + p$) (pip, + poy)-] 


Chapter 9 


Relativistic Rocket 


9.1. Introduction 


We shall present an example of how Minkowski’s equation of motion works 
by demonstrating its application on a relativistic rocket. A relativistic 
rocket, in principle and for all theoretical calculations, is the same familiar 
rocket that the students have studied in their mechanics books,* with the 
difference that the exhaust gas is ejected with a “relativistic speed” u and, 
as a consequence, the rocket accelerates to a relativistic speed in due time. 
What we call relativistic speed is roughly the range: c/3 < u < c, where c 
is the speed of light. Because of the relativistic velocities involved in this 
case Newtonian mechanics breaks down, and we have to use Minkowskian 
mechanics, in particular Minkowskian equation of motion (EoM). 

A relativistic rocket, i.e. a space-ship propelled by ejected gas to 
relativistic speed is not a reality [28]. However, one can still think of 
matter—antimatter annihilation rockets, and pion rockets for intellectual 
entertainment [29]. Even then the purpose behind our spending time on 
such an object is somewhat pedagogical. The exercises we are going to 
undertake are intended to sharpen ones understanding of Minkowskian 
equation of motion, employing 4-vectors. 

We have derived the mass equation for a relativistic rocket (see Eq. (9.8)) 
using momentum-energy conservation principles [30-33]. 


aSee [7, pp. 327-330]; [8, pp. 84-87, 144-147, 315-319, 328-332]. 
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Some features of this chapter that may kindle a special interest in a 
student or a teacher of special relativity are as follows: 


e We have subjected the two important equations derived in this chapter, 
namely, (a) the mass velocity equation (9.8), and (b) the EoM (9.29) to 
the N.R. test, by which we mean that all relativistic equations that have a 
non-relativistic (N.R.) analogue must converge to their N.R. counterparts 
when v <c. 

e Taking u as the ejection velocity of the emitted gas/radiation, we have 
obtained two special solutions of the EoM, corresponding to (i) wu = ¢/3 
and (ii) wu = c. We have plotted the velocity—time relation for both the 
cases, and shown that the plot for the case (i) closely follows the plot for 
the corresponding formula for v = v(t) obtained using N.R. (Newtonian) 
mechanics, up to vu ¥ 0.5c. 

e We have adopted a four-dimensional Minkowskian approach to obtain 
the EoM of the rocket, using 4-vectors, e.g. 4-velocity, 4-momentum, 
4-acceleration, 4-force. For this purpose, we have adopted a mathemati- 
cal formalism outlined by Moller [34]. 


Since the motion of the rocket will be one dimensional, confined to the 
x-direction, a typical 4-vector will have only t- and z- components and will 
: Y t x 
be written as A = (A’, A®). 


9.2. The Rocket, Its Specifications 


Let us now take a look at the rocket of our discussion. We have illustrated 
it in Fig. 9.1. It is moving along the X-axis with velocity v(t)m/s with 
respect to an inertial frame S$, which, for fixing the idea, we shall call the 
ground frame (GF). It is ejecting gas at a constant velocity —um/s and 
its rest mass at a constant rate r = the kg/s, relative to its instantaneous 


rest frame (IRF) S,(Q), thereby generating a reaction force (in this case 


a thrust force) ®. Our purpose is to find a formula for R, and then the 
equation of motion (EoM). 

Note that we have labelled the IRF with the extra tag (OQ) to stress that 
it coincides with the rocket frame R at the event “0”, which, for fixing the 
idea can be taken as “O: rocket passes a space station A”. Every IRF has 
to be associated with one, and only one, event “O”. 

Three quantities are specified for the assessing the performance of the 
rocket: u, r and M; = initial rest mass of the rocket at the instant t = 0, 
when it starts with zero velocity. In this chapter, M = M(O) will stand for 
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Oo x’ 


Z VA 
Fig. 9.1. The rocket. 


the instantaneous rest mass of the rocket at the event O. When written as 
a function of the “ground time” ¢, it will appear as M(t). 

Let us consider two infinitely close events 0, and Og (corresponding 
to the rocket passing two infinitely close space stations A and B on its 
path), the time-space coordinate differentials between them being (cot, dx) 
with respect to S, and (cdr, 0) with respect to S,. Between these events 
the rocket ejects a quantity of gas of rest mass duo. Consequently, its own 
velocity changes (i) from u(t) to v(t)+dv with respect to the GF S, (ii) from 
0 to dv’ with respect to S,, and (iii) its rest mass changes from M(t) 
to M(t) + 0M. Note that the time differential between the events being 
infinitesimally small, 67 is the proper time between the events. Also, note 


that, the rate of emission of the rest gas mass with respect to the rocket 


Apo 
dr 


In summary, the performance of the rocket is decided by three specifi- 
cations: (i) its initial mass Mj, (ii) the rate r = te. 
ejected from the rear end with respect to the IRF, (iii) the speed —u, with 
respect to the IRF, with which this rest mass is ejected. These quantities, 
being in the specification book supplied by the manufacturer, are frame 


independent and are to be taken as constants. 


frame is r = = lims_59 Se which is taken as a constant. 


at which rest mass is 


9.3. Review of the Non-relativistic Results 


We shall briefly review the non-relativistic (N.R.) rocket formulas so 
that we can compare the relativistic results with their N.R. counterparts. 
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We shall drop the subscript “,” from duo, because in the N.R. zone there is 
no such thing as proper mass. The N.R. formulas can be found in standard 
books on mechanics. We shall quote the following formulas [8]: 


OM 
v U Wr’ (9.1a) 
v=uln a8 > Or, “~~ =e a, (9.1b) 
M(v) Mi; 
T = thrust force = ru, (9.1c) 
du dM 
a es etd) 
dv . , 2» dv 
M(v) = =T=ru, or, using (9.1b) Mje7¥ re (9.1e) 
rt 
=-uln{1—— }. . 
v u In ( =) (9.1f) 


In (9.1b), M(v) is the same as M(t), since v = v(t). Equation (9.1d) in 
which du is the mass of the gas ejected in time dt, is a restatement of 
conservation of mass. The relationship between the velocity differential and 
mass differential shown in Eq. (9.1a) is a consequence of (i) conservation of 
mass, and (ii) conservation of linear momentum. The mass ratio equation 
(9.1b) is obtained by integrating the differentials in (9.la). Equation (9.1e) 
represents the EoM of the rocket. Equation (9.1f) gives the solution of the 
EoM, subject to the initial condition: v = 0 when t = 0. 


9.4. Relativistic Mass Equation 


We now have the following Lorentz factors, corresponding to the velocities 
to be used: 


g(u) = So T(v) = = T’(v’) = 1 


= 1-& fi-& 
As stated above, the rocket velocity changes from 0 to dv’, and it ejects a 
quantity of gas of rest mass dj1o from the event QO, to the event Og in the 
IRF S,. We shall write the components of the 4-momentum of the rocket 
at O, and Og, and of the gas ejected between these events — all of them 
in So. 

At this point, we emphasize once again that M = M(0) = M(t) = 
M(r) is the instantaneous rest mass of the rocket at the event © and hence, 
is a 4-scalar. 


(9.2) 
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The (t, 2)-components of the momentum 4-vectors we shall write below 
will follow from (8.17), in which we shall set m, = M(t). Also, note that 
the dynamic Lorentz factors are: g for the ejected gas, and I’ = 1 for 
the rocket, since v’ = 0, i.e. the rocket is momentarily at rest in S,. The 
4-vectors written below have only (t,)-components, and are valid in the 
IRF S,: 


AtQ,: PB =M(c,0). (9.3a) 
AtOg: B+65P =(M+6M)(c,5v’). (9.3b) 
5B = (5Mc, Mov’). (9.3¢) 
6p =dpog(c,—u). (9.3d) 


In the above oP stands for the change in 4-momentum of the rocket, and 
Op for the 4-momentum of the ejected gas, between the events O,4 and 
Og. We shall apply the conservation of 4-momentum in S,, using the data 
in Eqs. (9.3c) and (9.3d). 


SP = — 6p. (9.4a) 

From Egs. (9.3c) and (9.3d) t-comp : dM = —dpo g. (9.4b) 
z-comp : Médv' = duo gu. (9.4c) 

Hence, Mov! = —dM u. (9.4d) 


Note that (i) the rest mass lost by the rocket equals the relativistic mass 
gained by the ejected gas, according to (9.4b), (ii) Eq. (9.4d) is valid in 55. 
To validate it in S, we have to apply the velocity addition formula (4.6): 


dv’ dv’ 
vt do = Se © (de! +0) (1- m) wo4 (1-5) av’ (9.5) 


vu j 
dv =(1- a dv’. (9.6) 
This transforms Eq. (9.4d) 
ae sage (9.7) 
cs) “Mm 
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Integrating (9.7) from t = 0 to t = t, setting M = M; (i for “initial’”), 
and v = 0 at t= 0, we get 


Cc, Cc+v 1 M 
— ln —— = —-uln 
Gu M;’ 
M c—v\™ 1-—p Ta 
ee -— {= . 9.8 
o" 'M, (=) (5) (98) 


One of the requirements of all relativistic formulas is that they must 
converge to the corresponding N.R. counterparts (if such counterparts exist) 
in the N.R. limit v/c > 0. In this case, the N.R. mass formula is (9.1b). 
We shall show that this requirement is satisfied by the formula (9.8), using 
the definition of the Euler number: 


e @ lim (14 2)/*. (9.9) 
x0 


Proof. We set 6 =v/c. Then in the limit 6 > 0 
B-0 1 


1-B\* safe 1 
(5) 9 [+ B72) = [1 + B)B]-@/™. (9.10b) 
1+ 8 
Hence, (5) Es eo e/4) (9.10c) 
M639 (w/w) 
— ml 
or i, — (9.10d) 
same as the N.R. formula (9.1b) (QED) 


Is the formula (9.8) valid when u = c? To make sure, we shall retrace 
the steps from Eq. (9.3) downward, specializing them to u = c. Instead of 
assuming that the gas is ejected with velocity —u with respect to IRF So, 
we shall assume that, between the events 0,4 and Og, a beam of photons 
is emitted in the —2z-direction with energy dE, with respect to the IRF So. 
In this case, we use Eq. (8.26) for photon momentum: 


AtO4: PB = M(c,0). (9.11a) 
AtOp: P+5P =(M+5M)(c,5v’). (9.11b) 
5P = (5Mc, Mov’). (9.11e) 


6p = (S-) (9.11d) 


Cc Cc 
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In the above op stands for the 4-momentum of the emitted photon. 
We shall apply the conservation of 4-momentum in S,, using the data 
in Eqs. (9.11c) and (9.11d). 


5p =-87P. (9.12a) 
tcomp: 6M = a (9.12b) 
c 
z-comp: Mdv! = ahs (9.12c) 
c 
Hence, Méu' = —dM ce. (9.12d) 


It follows that Eq. (9.4d) will be valid for u = c. As a consequence (9.8) is 
also valid for u=c. We shall rewrite this for this special case: 
M c—v 1- 


i, =e z= = —— for a photon driven rocket. (9.13) 


9.5. The Thrust 4-Force 


The 4-momentum of the rocket at the event 0.4 can be written as PO A)= 
M(04)V(@a). We have used Eq. (8.17), replaced m, with M(O0,). The 
time difference between the events O04 and Oz is dt with respect to S, and 
67 with respect to the IRF S,. Differentiating Pp with respect to 7 we get 


dP me We dv diy 


= 9.14 
dr ( ) 
To evaluate 4“, we use (9.4b) 
dM dln | 
dr %dr 9” 
dito 
where r = : (9.15) 
dt 


To get a parallel formula for the photon-driven rocket, we refer to 
(9.12b), and get 


dM 1 dE, 
— = -— : 9.16 
dt c? dt ( ) 


We can combine the two equations into one, assuming that the rocket 


is ejecting relativistic mass, either in the form of matter or in the form of 
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radiation (we shall use the term radiation to mean photons), at the constant 
rate of o kg/s in its rest frame. 


do A fto 
For matter emission: @ = jim, a = = = gr. (9.17a) 
6E/c? 1 dE, 
For photon emission: @ = jim, ol = (9.17b) 


Note that r is constant by assumption, and g is constant because u is so. 
Hence @ is constant in (9.17a). We now assume that if photons are ejected 
to generate the reaction force, then 4 (9.17b). Then 


by (9.15) — (9.17) 


dM 
| eae aa constant (9.18) 
for both matter and radiation. 


We now go back to (9.14), and rewrite it as follows: 


ee GE pe 


where ® ae + ov (9.19b) 


(9.19a) 


is the “reaction 4-force”, or better still the thrust 4-force. However, we are 
using the symbol R instead of T’, because the latter symbol can be confused 
with time. 

In the following equations, we write the (t,x) components of the 
4-vectors in S>: 


Using (9.4a), (9.3d), (9.17a): id = = = — ike (c, —u) 
= —o(c, —u) (9.20a) 
From (8.8): V = (c,0), (9.20b) 


since v' = 0, IY = 1. 


Hence, from (9.19b): Re —o(c, —u) + o(c, 0) 


= 0(0,u) = (0, R). (9.20c) 
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In other words, the reaction 4-vector has the following components with 
respect to Sj: 


R = (RR) = (0,R) (9.21a) 
where R = ou = gru = reaction 3-force, with respect to S,. (9.21b) 


Note that the (¢,2)-components of the reaction 4-force R in S, are in 
agreement with (8.36). 

We shall now find the (t,2)-components of the reaction 4-force: 

= (R°,R*), in the ground frame S$, applying Lorentz transforma- 
tion Eq. (7.77a), corresponding to the boost: $,(—c8,0,0)S, to the (t, 2)- 
components of R in the IRF So, Shown in (9.21). 


Rt =T(R'+6R*%) =TBR. (9.22a) 
R® =T(R*+6R") =CR. (9.22b) 


Note that the (¢, 2)-components of the reaction 4-force R in S are in agree- 
ment with (8.36), which we rewrite in the present context as 


R= (RR) =P (=. R). (9.23) 


Referring back to Eq. (9.17) 


e For radiation emission R = oc = idee 


e For matter emission R = ou = gru = gT, where T = ru is the same 
thrust force of non-relativistic mechanics. See Eq. (9.1c). It changes to 
R= gT as it enters the relativistic domain. 


9.6. The Equation of Motion 


We return to Eq. (9.19a) and write the equation of motion 


ye R, (9.24) 
where M = M(Q) is the instantaneous rest mass of the rocket at the event 
O, and is a 4-scalar. All we now have to do is to write the z-component of 
the 4-vectors on either side of the equation, and simplify the same to obtain 
the acceleration a = x of the rocket in the GF S. We shall, however, find 
it convenient to work out the acceleration ag = ue in the IRF S,, and 
convert this acceleration to a using Eq. (8.46). 
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Consider the x-component of Vv using (8.8). The kinematic quantities 
in S, will be identified with “prime”. Then, 
dv _ dv’) _ det ad”, _ do! 


i a a ae a oa 


since v'= instantaneous velocity of the rocket in S, = 0. Consequently. 
i =1. 

From (9.21), the 2-component of R is R'* = R= ou. We thus get a 
simple looking equation of motion, which is valid in So. 


M(O)a, = ou = constant. (9.26) 


Mass x acceleration is constant. But mass is not constant. Hence, the 
acceleration in IRF S, is not constant. 

We shall write the EoM in the ground frame S, by converting a, > a, 
the acceleration in the ground frame S, using (8.16), which gives a, = T3a: 


M(@)T%a = ou = constant. (9.27a) 
d 
or, M(@) re = pu = constant. (9.27b) 


Now we rewrite Eq. (9.27b), using the mass equation (9.8): 


c-v\™_, dv ou 
r= = constant 2 
(: + “) ie. ee ee) 


where WM; is the initial (rest) mass of the rocket. 

We shall set 6 = v/c, and c/u = n in the index of the leftmost factor in 
Eq. (9.28). Here n > 1 is a positive real number greater than or equal to 1. 
n = 1 corresponds to u=c. On the other extreme n — oo would converge 
to the N.R. EoM shown in (9.le). We now simplify the left side: 


1 


"> GFP 


=[(1+4)a-,)*”, 
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The EoM (9.28) now takes a simpler form 


— dB ow 


— = k = constant. (9.29) 


(1+ 6) | dt cM; 


Let us rewrite the mass equation (9.8), setting v = cB; c/u =n: 


n 


M /1-8\? 
vi (5) . (9.30) 
We assume that the rocket has no payload, all its mass will ultimately be 
ejected out to provide the thrust. In other words, the rocket operates until 
M — 0, which happens when {8 — 1. 

We shall now show that the above EoM (9.29) will converge to the 
non-relativistic EoM as given in (9.le). We shall set 9 = gr as per (9.17a), 
8 — 0 and use the definition of Euler’s number e, as in (9.9). 


Proof. 


(1—6)"= | +9 1 1 
7 Aye => 7 Aes nes 
(Ay (hoy = | LOB) 
_ 1 _ 1 
(1+6)" (1+ 8)e/™ 
1 1 
a ey Ee 9.31 
((1 + B)V/B}v/u ev/u ( ) 
Substituting this in (9.29) we get: 
d 
Me" = ou = gru=ru, since g > l. 
Thus, we get back (9.1e). (QED) 


9.7. Solution of the EoM for Two Special Cases 


We shall find solution of the EoM for two special cases, namely (i) n = 
u/c = 3, and (ii) n = u/c = 1. The first case corresponds to the transition 
zone from non-relativistic to relativistic domain; the latter corresponds to 
a photon rocket. 


Example 1. Set n = 3, implying u = c/3. 


The reason for choosing n = 3 is two-fold: (1) the EoM shown in (9.29) 
will assume the simplest form, the numerator within the square brackets 
becoming 1; (2) we are now at the threshold of transition from the 
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non-relativistic (N.R.) to the relativistic domain, the I’-factor is very close 
to 1, in fact g = a = 1.06. Our results obtained here should be close 
to the N.R. results, so that we may feel comfortable that we are on right 
track. 

We specialize the EoM (9.29) for this special case: 


1 d 
oe “ a aT = k3 = constant. (9.32) 


Integration, subject to the initial condition: 6 = 0 when t = 0 leads to 
the following solution: 


1 
6= |p -)]. O<t<te. (9.33) 


The reader can verify the answer by differentiating @ with respect to t. 

We have written t. to mean “critical time”, when M — 0, as explained 
below Eq. (9.30). In other words, ¢t. is the time at “burn out”, assuming 
that the rocket has no payload, all its mass has been ultimately ejected out 
to provide the thrust. This happens when § > 1. From Eq. (9.33): 


1 
———_. - ]] =], 
Pree | 
1 3 
1-2kst-=- => kate =-, 
= oer ane 
gt. 9 
M; 8 
ote 


for the N.R. :—=1 
or the case M, 


(9.34) 


or 


We have plotted the velocity—time relation (rather the Gt relation) in 
Fig. 9.2(a), using Gnuplot. On the same graph, we have also plotted the 
N.R. equation (9.1f). 

We have set M; = 1lkg, r = 1kg/s. Setting u/c = 1/3 in the first 
of the equations in (9.2), we get g = 3//8 = 1.06. Hence @ (defined 
in Eq. (9.17a)) = gr = 1.06kg/m. Therefore kg = 347 = 0.3535. From 
Eq. (9.34) the critical time is t. = = 1.06s which has been set as 
the upper limit on the t axis. 


ee 
8x 0.3535 


The two plots, relativistic and non-relativistic almost coincide up to 
t+ 0.8s, 8 0.5. 
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Fig. 9.2. Case I, n = 3. Plots for (a) velocity vs. time; (b) acceleration vs. velocity. 


In Fig. 9.2(b), we have plotted “4 = k3(1 + 8)? = 0.3535(1 + 6). 
However, in this case we set the vertical axis to represent the independent 
variable 3, matching it with the vertical 6-axis of Fig. (9.2a). The hori- 
zontal axis, pointing to the left, represents the dependent variable ee We 
achieved this configuration by first plotting ap vs. 2 the usual way, then 
turning the plot anticlockwise by 90°. Our objective here has been to check 
whether the slope of the 6 — t-curve in Fig. 9.2(a) is corroborated by the 
measure of d3/dt in Fig. 9.2(b). In order to judge the correspondence, we 
marked four selected points on the curve (a) and their corresponding points 
on (b), wrote the values of a for these points on the upper horizontal axis 
of the plot box. Fair correspondence between these values in Fig (9.2b) and 
the corresponding slopes in 9.3(a) is discernible, suggesting that Eq. (9.33) 


is the solution of the EoM (9.32). 


Example 2. Set n = 1, implying u = c. 


This is the photon rocket mentioned in the Introduction. In this case, a 
jet of photons flowing out from the tail end of the rocket is serving as the 
propellant. We specialize the EoM (9.29) for this special case: 


1 dB @ _, | 
ee ao k, = constant. (9.35) 
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Integrating from t = 0 when 6 =0 tot=t;8=£6 


B dp - 
f weno (9.36) 
we get 
B inp... /# 
+B) + 1-6 kt = (56) « 
or 
(28) | Pata PT 
‘= (*) mire te 18" (9.37) 


We have plotted the velocity-time relation (rather the G-¢ relation) in 
Fig. 9.3(a). However, in Eq. (9.37) t is a function of $. Hence, using 
Gnuplot we first obtained 6 as the horizontal axis and t as the vertical 
axis. In order to reverse their roles, we transformed the plot by (i) a rota- 
tion through 90° in the anticlockwise direction, followed by (ii) a reflection 
about the vertical axis (i.e. about the new (-axis). 


2a = E20 2 

ied: So So s 

rt . . = 
. 
3 _ aaa |? 
$ t Sf = 
3 7 a =|2 
: ra pS — ha 

~ oo 


d B/dt 
0 
el 
J 


a = % 2 
- co Co 


(b) 


0.4 


acceleration 


Fig. 9.3. Case Il, n = 1. Plots for (a) velocity vs. time; (b) acceleration vs. velocity. 
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In Fig. 9.3(b), we have plotted a (its axis pointing left) vs. 8 (its axis 
pointing upward). The procedure, objective, and explanations are the same 
as in Example 1. 

It should be noted that in this case g is given by (9.17b), which we 
rewrite and interpret as follows: 


1dE, 1 as 
> = — x {radiative power emitted from the tail of the rocket}. 
cdr ¢ 


(9.38) 


O= 


For the plottings we have taken M; = 1kg, 0 = 1kg/s. 

How long does the rocket operate? Until 6 — 1, as mentioned below 
Eq. (9.30), and therefore, by (9.37), until t > oo. 

It is seen from the plot in Fig. 9.3(a) that 8 approaches unity (or, v 
approaches c) asymptotically. 
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Chapter 10 


Magnetism as a Relativistic 
Effect 


10.1. Velocity-Dependent Force from 
a Velocity-Independent One under 
a Lorentz Transformation 


We shall go back to Lorentz transformation of 4-force outlined in Sec. 8.7.4 
and reconsider how a force F’ on a particle at a particular event in S$’ will 
transform into a force F in S at the same event under the boost $(6, 0,0)", 
so that B = Gi. Specialize the force transformation formula (8.56) to this 
boost. 
+ B(F’-v’). re OM pe FE 

i ee, Co) eee ee 
Note that 

Bv'-F B 


. / 
F, + B(F’-v') = F, + ——— = F, (1+ Bu;,/c) + — 
Cc Cc 


(10.1) 


(Ey + Fu). 


(10.2) 
Therefore, 
Bf Flu, + Fy, 
F, a F’ a yoy 
at c 1+ Bvt, 
ye fw, +8 
Fy = yF,- ars (; + Bl Fy) . (10.3) 
B fv, teB 
Poe fF tire a F'! 
y z C 1 mie Bu, z 
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Hence it follows from Eq. (10.4) and the velocity addition formula 
(4.23b) that 


ae eee 2 Rn, + Flvz) 
Fy =F, - ey Fy, (10.4) 
a VF! — te 

iG 


It will be a simple exercise to show that 
v Xx (ix F’) = (uF, + u2F,)i — v2(Fyj+ Fk). (10.5) 
We can now write the force whose components are given in Eq. (10.4) 
as 
F = Fli+7(F/j+ Fok) + Ziv x (3x F’)). (10.6) 
We also break up the force F’ into two components, viz., a component 
Fy which is parallel to the boost velocity cB and a component F, which 
is perpendicular to c@, so that, F’ = F + F",, and so that for the special 
boost $(6,0,0)S", 
fl = Fi; FP. = Fjj+ Fk. (10.7) 
The general form for the force transformation formula, valid for a general 
boost $(8)5S", now follows from Eq. (10.6): 


P=F, +1 ¥G, (10.8a) 
where, Fy = Fi + 9F"',, (10.8b) 
and, G = 1(g x F’). (10.8¢) 


It is then seen from Eq. (10.8) that the transformed force F has a 
component F, which is velocity independent if F’ is so, and a part v x 
G which is explicitly velocity dependent provided G is not zero. We are 
therefore led to the following theorem: 


Theorem 10.1. Suppose a particle P moving in an arbitrary trajectory 
experiences a purely velocity-independent force F’ as measured in an inertial 
frame of reference S’. Suppose this frame S' is moving with velocity cB with 
respect to another inertial frame S, and that the velocity of the particle 
at some instant t is v in S. Then the force F, as measured in S at the 
time t, will be velocity dependent, being the sum of a velocity-independent 
component Fy and a purely velocity-dependent component v x G as given 


by Eq. (10.8). 
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10.2. How Magnetic Force Originates from Lorentz 
Transformation 


The starting point of the principles of electromagnetism has of two parts, 
namely (1) the Lorentz force equation which defines the electric field E 
and the magnetic field B in terms of the force F that a charged particle q 
will experience under the influence of a distribution of electric charges and 
currents, and (2) Maxwell’s equations. Using these two sets of equations, 
one can understand and explain every phenomenon in the domain of elec- 
tromagnetism, including attraction and repulsion between electric charges, 
electric currents, magnets, operations of motors and generators, as well as 
the propagation of electromagnetic waves. 

Let us now consider a distribution of charge Q which is static in an 
inertial frame S’. Only an electrostatic field E’ exists in this frame, so that 
the force experienced by a moving test particle of charge gq is the velocity- 
independent force F’ = qE’. If the distribution Q is in bulk motion with 
constant velocity u = c@ as seen from another frame S, then S and S’ 
must be related to each other by the boost: $()S’. It now follows from 
the above theorem, in particular Eq. (10.8), that the same particle will be 
seen to experience a velocity-dependent force F in S, which is given by the 


formula: 
F = q(E+v xB) (10.9a) 
where E= E) + 7E‘, (10.9b) 
x ’ 1 
and B= —-(8x E’)=-8~xE, (10.9c) 
c c 
since 3 x BE; = 0. Line (a) in the above equation is the Lorentz force 


equation. Lines (b) and (c) show how a pure electric field E’ in the frame 
S’ transforms into a combination of electric field E and magnetic field B 
in the frame S. Hence the main conclusion of this chapter. 


Conclusion: A distribution of charges Q, when moving with uniform 
velocity u = cB, will create, in addition to an electric field E, also a mag- 
netic field B. The emergence of the resulting magnetic force can be linked 
to the Lorentz transformation of contravariant 4-vectors which itself is a 
consequence of the postulates of special relativity. In this sense magnetism 
is a relativistic effect. 


Note that the above exercise does not shed any light on what happens 
when the charge distribution Q moves with an arbitrary (non-uniform) 
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velocity. The field resulting from such a motion can be worked using the 
full set of Maxwell’s equations. 

We shall come back to Eq. (10.9) in Sec. 11.4 through Eq. (11.37), and 
in Sec. 11.6 through Eq. (11.63b) — by a different route, namely, Lorentz 
transformation of the electromagnetic field tensor. 

It may be appropriate to close this discussion with a quotation from 
Leigh Page.* “The rotating armature of every generator and every motor 
in this age of electricity are steadily proclaiming the truth of the relativity 
theory to all those who have ears to hear.” 


@L. Page, Lecture at December 17, 1941 meeting of the American Institute of Electrical 
Engineers, New York. Quoted at the beginning of Chapter 3 of Ref. [5]. 


Chapter 11 


Principle of Covariance with 
Application in Classical 
Electrodynamics 


11.1. The Principle 
Physics is geometry. Consider Newton’s second law of motion: 


dp 
= (11.1) 


The left-hand side is a “prescribed vector” — implying thereby a straight 
line segment of pre-specified “length” and “direction”. The right-hand side 


“parallel trans- 


is a “constructed” vector, obtained through operations like 
port”, drawing a “vector triangle” and division of one of its side with a 
3-scalar dt, as illustrated in Fig. 11.1. 

A particle, while in motion, traces out a certain path T. Two nearby 
locations A and B on this path are reached by the particle at times t 
and t + dt. Its 3-momenta at these two instants are p(t) and p(t + dt), 
respectively. The average force F between these two instants, multiplied 
by the time dt is (approximately) equal to dp, as illustrated in the upper 
box. 

The ratio of the change in the momentum vector dp to the time differ- 
ence ot is the average force F acting on the particle between A and B. This 
average force becomes the instantaneous force F at the instant t if we make 
the time difference infinitesimally small, i.e. when dt > dt. 

The process employed in this example epitomize the structure of most 
physical laws. The Lord of the universe conceived physical quantities of 
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Fig. 11.1. Graphical construction of Newton’s second Law of motion. 


nature as “geometrical objects” and ordained them to shape through rules 
of geometrical constructions into different, but all the same, geometrical 
entities. 

Even though the geometrical picture may not be overtly manifest in 
the equations of physics, geometrical concepts and constructions are sub- 
tly at work behind the structure of physical laws. This is because scalars, 
vectors and tensors are all geometrical objects, as we highlighted at the 
end of Sec. 5.1.1, and illustrated in Fig. 5.1. The shape of every physi- 
cal equation — which either establishes a law — or proclaims a certain 
rule, convention, definition or relation — must be such that the geomet- 
rical objects on either side of the equation have come down to the same 
rank through processes of geometrical construction. This, in nutshell, is 
the Principle of Covariance. 

Apparently then this principle is as old as Newtonian Mechanics. How- 
ever, the advent of relativity adds extra dimension to this principle by 
reminding us that the physical quantities are no longer geometrical objects 
of the visible three-dimensional world, but are inhabitants of the four- 
dimensional space-time. 

The Principle of Covariance declares that the mathematical expressions 
of all physical laws must be written in such a way that either side of the 
equation is a 4-tensor of the same rank and same sequence of contravariant 
and covariant indices. When an equation satisfies this demand, we say that 
it is covariant. By corollary, a law or a relationship which cannot be written 
covariantly, must have a limited application, spatially or temporally, and 
cannot be regarded as a universal law. 

The law of motion written in the forms (8.28) and (8.33) are two exam- 
ples of covariant equation. 
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In this chapter, we shall recast the familiar equations of classical elec- 
trodynamics, the equation of continuity, Maxwell’s equations, the electro- 
magnetic energy-momentum conservation laws in the covariant forms. 


11.2. The Flux of a Vector Field in E° 


11.2.1. 2D surface embedded in 3D space, and the 
outward normal 


Figure 11.2(a) shows a two-dimensional surface S in the three-dimensional 
Euclidean space. The equation for such a surface is usually represented by 
a mathematical equation in the form 


Implicit form UV = U(x,y,z) =C, C =constant. (11.2) 
11.2 
Parametric form « = f(u,v), y=g(u,v), z= h(u,v). 


In the implicit form, it is not possible to tell from the equation which one 
is the independent variable, because all variables are treated as equal. In 
the parametric form, a point (2,y,z) on the surface S is determined by 
two parameters, suggesting that the surface is a two-dimensional object, 
embedded in the three-dimensional space spanned by the (X,Y, Z)-axes. 
The simplest example of both can be provided by the equation of the 
surface of a sphere of radius R, and having the centre at the origin, shown 


< 


o = constant 


x 


(a) 


Fig. 11.2. (a) Surface and its normal; (b) a spherical surface shown with unit vectors in 
the spherical coordinate system. 
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in Fig. 11.2(b). 


Implicit form U = x? + y? + 2? = R?. 
11.3 
Parametric form « = R sin@ cos¢, y= R sind sing, z = R cos#. 


In the second case, the spherical coordinates (0, 6) serve as the parameters 
(u,v). 

Surface integration of a vector field involves the unit normal vector n 
drawn at each point (x,y,z) on the surface S. The gradient vector VU 
points in the direction of the outward normal (i.e. in the direction in which 


it is increasing). Hence 
_ { V¥(r) 
n= (EET) an at 


is the outward normal at a point r = ro on the surface. As an example, the 
outward normal drawn at a point ro on the spherical surface S of Eq. (11.3) 
is given by the following expression: 


ate = (“etek Solin, (11.5) 


where (e,,e9,e4) are the three unit vectors on the surface, associated 
with the spherical coordinate system, being in the directions of increas- 
ing r,0,@, respectively (indicated by the curves C;..Cg,C4, respectively). 
Of these three, e, is identified with n, the normal vector, the other two 
being tangent vectors to the surface. 


11.2.2. Surface integral, flux of a vector field 


Consider a vector field F (r), “flowing” through an open surface S which 
is “oriented” (i.e. its unit normal at each point is uniquely defined). This 
surface S can be divided into a very large number N of small patches 
6a1,0a2,...,dan, the largest of them having an area of 6a; i.e. day < da; 
1 <i < N. Consider one such patch da; whose centre is located at the 
coordinates r;. Let the unit normal vector on this patch be n;. The vector 
field at r; is F(r;). Then the surface integral of the vector field F (r) over 
S, represented by the symbol yp is defined as the limit of a sum, as follows: 


qo 
i [f Fe a ae y BI (ri) “Ny Oai, (11.6) 


and is called the flux of the vector field F (rv) across the surface S. 
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We shall not elaborate on this further, but illustrate the surface inte- 
gration with a familiar example, taking the vector field F(r) to be the 
electric field E(r) emanating from a point charge Q sitting at the origin. 
The surface S is the upper hemisphere with the centre at the origin, i.e. 
the surface of Eq. (11.3), expressed in the spherical coordinate system as 
follows: r= R;0<0<7/2;0<¢< 2n. 

In this case a ) = E(r) = — +e,, and da = r? sind d0d¢e,, so 
that E-da = sin 60 dO do. Hence, 


— 


Q 


TTEQ 


n= (11.7) 


= 


nm /2 
oe sin 6 dé dé = 
i 2 


is the flux of the E (r) field across the hemisphere. 
The flux of a vector field is closely associated with Gauss’s divergence 
theorem, which we state here as follows: 


Theorem 11.1. Let F (r) be a vector field which is continuous, along with 
continuous derivatives in a region of space R. Let S be a closed piecewise 
smooth surface in R, forming the boundary of the volume V within. Let 
n(r) be a unit outwardly directed normal vector on S at a point r on S. 
Then 


|| gee dr=[f ve )-nda= [f Fee da. (11.8) 


11.2.3. Continuity equation 


Let us consider a fluid in streamline motion* as in Fig. 11.3. This fluid 
is characterized by a velocity field u(r,t) and a fluid mass density o(r, t), 
both of which, in general, are unsteady fields, i.e. functions of t as well. 

In pre-relativistic physics mass is conserved. There is an equation, some- 
times called the equation of continuity, which expresses this conservation 
rule, namely, 


—+V-(ou) = : (11.9) 


Note that the product ou of fluid mass density and fluid velocity is the 
fluid flux density. 

Conservation equitations of various physical quantities have a structure 
which is similar to Eq. (11.9). Since conservation equations play a very 


®We imagine the fluid particles to be moving together without random thermal motion. 
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Fig. 11.3. Illustrating mass conservation. 


important role in physics as well as in the remaining part of this chapter, 
we shall demonstrate how the above equation is derived. 

Consider the imaginary closed surface S, fixed in space, and embedded 
inside a stream of fluid. Since fluid masses flow in and flow out of the space 
V inside S, the content of fluid mass in it is a function of time. We write 


it as follows: 
M(t) = ie a(r, t)d?r. (11.10) 


The change of mass over time dt is 


dM =fff, a(et+ andr fff o(r, t)d®r = (// a) dt. 


(11.11) 


Conservation of a physical quantity means that if the content of this 
quantity inside a fixed volume V has increased by a certain amount over 
a given time, the same amount must have flowed in through the boundary 
surface S in this time. Applied to mass conservation, it means: 


- (f/f low.sate.0) nda) dt 
: (///, Vv. ote. ule) dt. (11.12) 


We have used Gauss’s divergence theorem to convert the first equality to 
the second one. The minus sign appears because the surface integral, if 


dM 
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positive, would mean an outflow (the unit vector n is an outward normal). 
Equating the right-hand sides of the above two equations, we get 


ik (20: ae 2 a(e, u(r) ay dt = 0. (11,13) 


Since the result is valid for arbitrary V and arbitrary dt, the integrand is 
zero. Hence Eq. (11.9) follows. (QED) 
All conservations laws have the same format as in Eq. (11.9), namely 


S (volume density) + V - (flux density) = 0. (11.14) 


We have written “volume density” in italics, and “flux density” in bold, 
to indicate that the former is a scalar, and the latter a vector. 

How shall we write the continuity equation covariantly, i.e. in the con- 
text of relativity? By (1) making sure that the quantity in question is 
conserved, e.g. energy of a stream of particles flowing like a fluid, the elec- 
tric charge contained in a charged fluid, as in Sec. 11.3; (2) multiplying the 
numerator and the denominator of the first term in (11.14) with c. Let o 
(pronounced varrho) represent the volume density, and 7 the correspond- 
ing flux density of this quantity (e.g. energy, charge). By definition, the 
flow of the quantity per unit time across the surface element da = nda is 
given by the relation 


dp = J - da. (11.15) 


We now rewrite the continuity equation (11.14) in the following more precise 
language: 


d (co) 
O(ct) 


+V-F=0. (11.16) 


Written covariantly, as in Eq. (11.29) 


ViT" (x) =0, 


(11.17) 
where J" = (co, 7) 


represent the time and space components of the 4-vector density F of the 
assumed conserved quantity. 
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11.3. Conservation of Electric Charge 


There are only two types of force which can be understood in the “classi- 
cal language”, i.e. without using quantum mechanics. One of them is the 
force of gravity — the most commonly and universally experienced force 
of nature. This force, however, comes under the purview of General Rel- 
ativity. Moreover, this force is too weak compared to the electromagnetic 
forces to have any effect at all on the motion of subatomic particles which 
can be in relativistic motion at the laboratories. 

This leaves us with only one force, namely the electromagnetic force, 
which has a classical structure. We shall recast this classical structure of 
electromagnetic theory into a covariant form in order to illustrate the lan- 
guage of covariance. In the discussions to follow we shall not present any 
detailed discussion or derivation of the formulas, for which the reader has 
to look into standard books on electrodynamics. 

We shall start with the basic postulates of electrodynamics and express 
them first in the “classical” language and then in the covariant format. 


Postulate I. q is the electric charge of a particle, moving or stationary, then 
the measure of q is the same in all inertial frames. In other words, q is a 
4-scalar. 


Postulate IT. Electric charge is conserved. This charge conservation law is 
customarily expressed in the form of the continuity equation (Sec. 11.2.3): 


dp(r, t) 
Ot 


+V-J(r,t) =0. (11.18) 


In the above equation, p(r,t) and J(r,t) represent, respectively, the charge 
density and the charge current density at the event O = (ct,r) = (a), as 
measured in a given frame of reference S, which for explicitness we shall 
call the Lab frame. 

If the electric current distribution is due to a “streamline motion” of an 
electrically charge fluid, as shown in Fig. 11.4, and if u(r, t) is the stream’s 
3-velocity (in the Lab frame) at the event (x), then 


J(r,t) = p(r,t) u(r, t). (11.19) 


We can think of a comoving frame of reference S, moving with the 
charge stream at the event (x). The charge density po(r,t) = po(x) at (x), 
measured in the frame S,, will be called the proper charge density of the 
fluid at (a), and will be treated as a 4-scalar density. The dynamic Lorentz 
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Zo 
Xx 
Z 
Fig. 11.4. Lab frame S and comoving frame So in a streamline flow of particles. 
factor of the fluid at (a) is 
1 
T(x) = (11.20) 


Imagine a collection of particles containing a quantity of charge dq, 
enclosed within the boundaries of a box whose volume is 6V in S' and 6V, 
in S,, so that 


Og = Po(x)dV, = p(a)doV. (11.21) 


Lorentz contraction of the dimension of this box along the direction of u, 
changes its proper volume dV, to the laboratory volume 


1 
6V = Te (11.22) 
By (11.21) and (12.3) 
p(t) =T(@)p0(a). (11.23) 


The 4-velocity of the charged fluid at (x) is, according to (8.7), 
U*" (x) =T(2)(c, u(2)). (11.24) 
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Therefore, we define the 4-current density of the electric charge fluid as 
J" (a) = po(x%)U* (a). (11.25) 
From (11.25), (11.24) and (11.23) 
J" (a) = po(x)T(2)(c, u(x) 
p(x)c, p(x)u(x)) (11.26) 
p(a)e, I(x)). 


Thus, p(x) times c constitutes the time component, and J(x) constitutes the 
three space components of J“(x). Let us rewrite the continuity equation 
(11.18) as 


Hep) , Ode, Oly , Ad 
O(ct) Ox Oy Oz 


=0. (11.27) 


The 4-gradient operators V,,,V“ were introduced and elaborated through 
Egs. (7.72), (7.113). We rewrite the first one explicitly as follows: 


_d@ (8 8 0 a\ (a 
Vu = age = (Sars) = (av): eee) 


The charge continuity equation (11.18), rewritten in (11.27), now assumes 
the form: 


Vt (a) =0. (11.29) 


The left-hand side of the equation appears like a contraction, reducing it toa 
4-scalar. The right-hand side is also a 4-scalar, having a single component 0. 
Hence, it is a covariant equation. 


11.4. The Electromagnetic Field Tensor 
Let us make the third postulate of electrodynamics. 


Postulate III. The force experienced by a particle carrying an electric 
charge q is a velocity-dependent force, called Lorentz force (see Eq. 10.12), 
written as 


F =q(E+v xB), (11.30) 


where v is the velocity of the charged particle at the event point (a). The 
above equation also serves as the definition of the electric field EK and the 
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magnetic field B at the location of the particle. Let us write the dynamic 
Lorentz factor for the particle’s velocity: 


= (11.31) 
Substituting the Lorentz force (11.30) in (8.36), the time and space 


components of the corresponding Minkowski force J" are now obtained 
compactly as 


1 
F =} 9" = or (2B-v, B+vxB), (11.32) 
c 
and in an expanded form as 
T T 
foo p.eyi Ezvz + Eyvy + Ezvz), 
c c 
1 qt 
F'=TF, = —(£,c+cBzv, — cByvz), 
‘ (11.33) 
2 qt 
F =F, = —(L£,c+cBzvz —cBzvz), 
c 
3 qv 
F=TF, = —(E,c+cByvz —cBzvy). 
c 


The above equation tells us that the Minkowski 4-force acting on a charged 
particle q is a linear function of its 4-velocity, and therefore can be written as 


gu = Tpwy, (11.34) 
Cc 


In the above equation, V, is the covariant form of the 4-velocity vector V", 
defined in (8.7) and is obtained from it by the lowering operation, 


Vi = 9uwV" = (Tc, -Iv) = (Uc, -Tvz, -Tvy, -Tvz). (11.35) 


F#”. as defined by (11.34), is a very important contravariant tensor of 
rank 2, called electromagnetic field tensor, and must have the following 
components as suggested by Eq. (11.33): 


0 1 2 3 

0 -E, —E, —E, 

E, 0 —cB, cB, pvr — — fey, (11.36) 
Ey, cB, 0 —cB, |’ 

E, —cBy, cBz 0 


0 
fey = 1 
2 
3 
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The electromagnetic field tensor as written in (11.36) is an antisymmet- 
ric contravariant 4-tensor. Hence, its diagonal elements are all zero, and 
the off-diagonal elements on the upper side of the diagonal are equal and 
opposite to the corresponding off-diagonal elements on the lower side of the 
diagonal. Hence, it has only six independent components, and they are the 
six 3-scalar components of the (E, B) field. 

Equation (11.34) is the covariant expression of the Lorentz force 
equation (11.30). Using the Lorentz transformation formula (7.80) for a 
contravariant tensor, the reader should be able to obtain the following 
transformation rule for the components of the (E, B) field under the boost: 
S(8,0,0)S’: 


El = E,, ae =. 
EF, = (Ey — BcBr), cBy = (cBy + BEz), (11.37) 
EB, = (Ez + BcBy), cB, =7(cBz — BEy). 


To illustrate the procedure, we shall work out two of the above transfor- 
mation formulas in details, namely for E/, and B!, using the Lorentz matrix 
as given in (7.69), to transform the components of the tensor (11.36): 

Ee ae a (7 SOF? = 01,0°, Fo T 01,0 F' 
=7(1 — B?)E, = Ep. 
cB, = FF?) = 07,01 FOP _ 09h + 07,01, F?! = 7(cB, — BEz). 
(11.38) 


Note that even though each of the two lines above apparently involve a sum 
of 16 terms, corresponding to a, 8 = 0,1, 2,3 we have accommodated only 
the non-zero terms, which happen to be only two in number in each case. 


11.5. The Field Equations of Electrodynamics 
in the Covariant Language 


We shall now establish covariance of the field equations of electrodynamics. 
These equations, expressed in their more familiar non-covariant form, are 
known as Mazwell’s equations. They separate out into two sets of 1+3 
equations, namely (1) the inhomogeneous equations containing the source 
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terms, and (2) the homogeneous equations without the source terms: 


1 
V-E(r,t) = ach), (11.39a) 
Cc 
Inhomogeneous part: : 


OE (r,t) 1 
V x cB (r, t) = es = ae (11.39b) 
V -cB(r,t) =0, (11.39c) 
Homogeneous part: B 
V x E(r,t)+ eB et) =i (11.39d) 


To convert the inhomogeneous part, given by (11.39a) and (11.39b), we 
first expand them into four separate equations in terms of the Cartesian 
components of E,cB, J. Identifying these terms in the expanded expression 
as the components of J“(x) and F“” (a), with the help of (11.26) and (11.36) 
it should be easy to see that these two 1+3 equations fuse into a single 
4-equation, i.e. one covariant equation: 


VoF(x) = —J"(a), (11.40) 


We shall demonstrate explicitly how (11.39a) shapes into the p = 0 
component of (11.40): 


OE, . OBy , OB 


Left side = js = V F10 F20 F30 
eft side Da Dy De 1 + V2 + V3 
= Var: 
= 5 1 0 0 1 0 
Right side = — J”. Hence, VgF¥) = —J”. (QED) 
Ege Ege 


With the above hint the reader should be able to demonstrate equivalence 
between the x,y, z components of (11.39b) and the pw = 1,2,3 components 
of (11.40). 

Now we shall convert Eqs. (11.39c) and (11.39d) into covariant form. 
For this purpose, we shall obtain from F” its dual §"” by the following 
operation: 


FY" (a) = sot”? Fup (2). (11.41) 
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Here Fg is the covariant tensor obtained from F'“” by lowering both indices 
(Reader, confirm it): 


0 Ek Hk E, 
—E, 0 -—cB, cB, 
—E, cB, 0 —-cB, 
—E,—cBy cB, 0 


Fag => Jopg eve” — (11.42) 


and e#”° is the Levi-Civita symbol. We had defined this as a 3-symbol in 
Sec. 5.1.3, under Eq. (5.13). We now extend the same symbol to a 4-symbol, 
defined as 


0 if any two indices equal, 


ehvaB — 2 7 if upva8 = 0123, or any even permutation of 0123, (11.43) 
—1 if pve is any odd permutation of 0123. 


For example, 1023 and 1203 are obtained from a single (i.e. odd) permu- 
tation and from two (i.e. even) permutations of 0123. The reader should 
prove the following important identity: 


OOo eer See, (11.44) 


where Q represents a proper Lorentz transformation matrix. As a conse- 
quence, we can regard the Levi-Civita symbol to be a contravariant tensor of 
rank 4, which transforms into itself under a proper Lorentz transformation. 
There is only one other tensor, namely, the metric tensor g,, which has 
a similar property. Going back to (11.41) and (11.42), the reader should 
work out the components of §/”, and show that 


0 1 2 3 
0 —cB, —cB, —cB, 
cB, 0 E, —kEy (11.45) 


It should now be a simple exercise to show that Eqs. (11.39c) and 
(11.39d) can be written covariantly as 


Vas =0. (11.46) 
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We shall demonstrate explicitly for one component, say p = 3. 


Vad = Voe? + Vi88 + Vad"? + V3ae"3 
O ) 
= aa By (E) 


_ OE, OE, | OcBz\ _ OcB 
--(3- Oy + oe) = |v xB oo 


a) 
—cBy) + 5-(-Ey) + 


= 0. (QED) 
x 

In summary, Maxwell’s equations written in 1+3 forms in (11.39) reduce 
to two covariant equations: 


1 
Inhomogeneous part: Va F(x) = —J"(x). (a) 


Egc (11.47) 
Homogeneous part: Voes°'"(x) = 0. (b) 


There is one more way to express the homogeneous equation, namely, 
VERY LV” PIE 1X71 FRY = 0), (11.48) 


Note that there is a cyclic permutation of the three indices u,v, in 
the above equation. Also, due to antisymmetry of F'“"” the left-handside 
is identically zero, unless y4,v,n are all different. This equation there- 
fore represents only four equations, namely corresponding to (u,v,7) = 
(0,1, 2), (0, 2,3), (0, 3,1), (1, 2,3). These four equations correspond to the 
1+8 equations represented by (11.39c) and (11.39d). The four components 
of the operator V" are shown in (7.113b). 

We shall verify equivalence between (11.48) and (11.39c) and (11.39d) 
for one combination, namely vj = 012, leaving to the reader verification 
for the other three combinations: 


Vor? 4 Vi p20 4 72 pol = O(—cB:) _ OF y | O(-Ex) 


Oct Ox Oy 
OcB 
--|vxE+ aa | =0. (QED) 


Therefore, Maxwell’s equations written in 1+3 forms in (11.39) can be 
expressed covariantly in the second way: 


1 
Ec 
Homogeneous part: V4 FY’ 4+ V" PMH + V1 FHY = 0, 


Inhomogeneous part: Vo F(x) = 


J*(x). 
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There is a third way, perhaps a more powerful way, of expressing 
Maxwell’s equations, namely, by writing them in terms of potentials. The 
homogeneous equations, i.e. Eqs. (11.39c) and (11.39d), define the scalar 
potential ®(r,t) and the vector potential A(r,¢t) through the relations: 


cB (r,t) = V x cA(r,t), (11.50a) 
_ OcA(r, t) 


E (r, t) = —V r,t) cot ) 


(11.50b) 


because if we write (E,B) in the form of (11.50), the homogeneous part 
(11.39c) and (11.39d) of Maxwell’s equations will be identically satisfied. 
Adopting Lorentz gauge: 
O(r,t)/c _ 


: __————_ = 11.51 
V- Art) + 0, (11.51) 


the source equations now reduce to the following inhomogeneous wave 


equations: 
V5-- Oa (11.52a) 
Ss == 52a 
c2 Ot? aoe 
107A 1 
VA- => = —J. 11.52b 
c? Ot? Egc ( ) 


The set of Eqs. (11.50)—(11.52) constitute the potential form of 
Maxwell’s equations (11.39). Together they are equivalent to the complete 
set of Maxwell’s equations (11.39). We shall convert each of them into a 
covariant equation. 

For this, we first define the 4-potential A” to be a contravariant 4-vector 
with space and time components: 


AM(a) = (2810). A) (11.53) 


and the d’Alembertian operator 0”, obtained by taking the “scalar prod- 
uct” of the operator V,, with itself, as shown in (7.112c). 

It is now obvious that the two 3-vector equations (11.50), defining the 
scalar and vector potentials, are equal to the following single “covariant 
equation”, obtained by replacing the components of (E,cB) and (®/c, A) 
by the corresponding components of F#” and A*, identified from (11.36) 
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and (11.53), respectively: 
FY’ (2) = c[V" A" (a) — V’ A" (a)). (11.54) 


We shall demonstrate this equivalence for one specific example, namely, 
pv = 10. 


Right side = c[V'A°—V° A] = ———— =f.=F", (QED) 


Also, the Lorentz gauge condition (11.51) is clearly seen to be equivalent 
to the following covariant equation: 


V,A"(x) =0. (11.55) 


Finally, the 1+3 inhomogeneous wave equations for the scalar and vec- 
tor potentials, namely Eq. (11.52), now become one single covariant wave 
equation for the 4-potential: 


1 
EgCc? 


*AM(@) = 


J# (a). (11.56) 


In summary, we have the Potential form of Maxwell’s equations: 


4-Pot defined: FH’ (x) = c[V" A’ (ax) — VY A" (a))]. 


Lorentz Gauge: V,A"(x) = 0. (11.57) 


1 
Wave equation: 0? A¥ (x) = — 3" (x). 
Ege 


We can supplement these equations with two more equations which 
represent the only invariants of the electromagnetic fields (i.e. they retain 
their values under a Lorentz transformation): 


FF Fg = 2(c° B* — E”), (11.58a) 
F°° 3,3 = 2cB-E. (11.58b) 
This exercise is best left to the reader. There are some interesting con- 


sequences of the above invariants, which the reader will explore through 
some examples in the problem set. 
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Before leaving this chapter, we remind the reader the covariant expres- 
sions of the equations of Classical Electrodynamics, laid down in the form of 


(1) Continuity equation: (11.29); 

(2) Minkowski force (corresponding the Lorentz force equation: (11.34); 

(3) Maxwell’s equations in terms of the field tensor in two ways: (11.47), 
(11.49); 

(4) Maxwell’s equations in terms of the 4-potential: (11.57). 


We have framed each of these equations in a box to stress their 
importance. 


11.6. EM Field of a Charged Particle in Uniform Motion 


What we are doing in this section is partly a reflection of the statements 
we had made in Sec. 10.2. What we had done in that section using force 
transformation, will be done here for a very specific example, using field 
transformation. It is heartening to see that the force transformation and 
the field transformation, though they evolve from two entirely different 
offshoots of the special theory of relativity blend with each other, showing 
a larger homogeneity and consistency of the theory. 


11.6.1. Transformation from the rest frame to Lab frame 


The charge q is seated permanently at the origin of its rest frame S’. It is 
now viewed from the Lab frame S, moving in the negative X direction with 
velocity Gc, relative to S$’, so that the charge is moving in the positive X 
direction with respect to the Lab, as explained in Fig. 11.5. The origins of 
S and S’ coincide at t = t/ = 0. Let us set k = Ta The EM field in the 
rest frame is 


Is /s vi 
a = = cB! = 0, (11.59) 
where r’? = a’? + y? 4 2”. 

Let us now consider an event O having coordinates (x,y, z,ct) in S$ 
and (2’,y’,2’,ct’) in S”. Let the EM field at this event be (E,cB) in S$ 
and (E’,cB’) in S’. The Lorentz transformation of the coordinates from 
S’ to S is given by the formula (3.9), whereas the Lorentz transformation 
of the field components (at the event ©) from S’ to S is the inverse of the 
equations (11.37), i.e. with 8 replaced by —£. 
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Z Z’ 


Fig. 11.5. The rest frame S' of the point charge g moving with velocity c8 with respect 
to the Observer’s frame 9’. 


Now consider the following event Op = “the EM field due to the 
above point charge q is detected by a certain observer P” situated on the 
XY-plane, so that z = z’ = 0. Ignoring the z coordinate, we transform 
the field given in (11.59) to the frame 5S”, using (3.9) and (11.37). In this 
first conversion, the radial distance r’ transforms into R, which we write in 
its relation to S$ 


pr? =? (2 — Bet)? +y? +27 = R?. (11.60) 


The EM field (E,cB) in S transforms into the EM field (E’,cB’) in S’, 
whose components are given as 


x (a — Bet) - _ 
Ey, = Ey =k 3 =k R3 5 cB, = cB! =0, 

oy 399 
By = Ey =k 78 kay cBy = (cB, — BE,) = 0, 

Ve VF y 
E,= VE, 3 kas 0, cB, =7(cB, + BE,) = kyB ps 


(11.61) 


Let r be the radius vector stretching from the instantaneous location of 
charge q at the time ¢t to the field point P, as seen from the Lab frame S. 
Then 


r = (x — Bct)it+ yjt+ zk, 


(11.62) 
and note that: 6 x E = Gi x (E,i+ E,j) = BE,k. 
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Now the six components of the transformed field written in (11.61) can be 
written compactly as 


ATED R®’ 


cB = Bx E. (11.63b) 


(11.63a) 


Note that Eq. (11.63) can be obtained entirely from (10.9) of Sec. 10.2, if 
we write for the E’ field the expression given in (11.59). 


11.6.2. Pictorial interpretation of the fields 


To get a picture of the E field, we shall obtain two sets of plot, showing (a) 
its angular distribution around the moving charge q; (b) its time variation 
at any given observation point P. For each case, we shall use “Gnuplot” 
to create exact plots corresponding to three values of 3, namely 6 < 1,= 
0.4, 0.95 in the first case, and 8 = 0.4,0.8,0.95 in the second case. 


Angular distribution of the field around the moving charge 


We shall obtain a pictorial interpretation of the E field described by 
Eq. (11.63a). Even though we had taken the field point P on the X Y-plane 
for convenience, that restriction is withdrawn for writing the general expres- 
sion for the field in terms of the radius vectors. The E field is radial 
in both frames of reference S’ and S (i.e. emanating radially from the 
instantaneous location of the point charge). However, it is the isotropic 
Coulomb field in the rest frame S, whereas angle dependent in the Lab 
frame S’. 

To picture this we have set up “displaced” Cartesian axes XxX ,Y,Z of 
the Lab frame, the origin A of which coincides with the instantaneous 
location of the charge q at the instant t, shown in Fig. 11.6. We shall 
denote the “displaced” Cartesian coordinates as (%,y,z), and set up the 
spherical coordinates (r, 0, @), as illustrated in Fig. 11.6. It is obvious that 
x = x — Bct whereas the “displaced” y, z coordinates are the same as with 
respect to the original axes. 

The relations (1) between the (%,y,z) and (r,0,@), and (2) between 
the Cartesian unit vectors and the spherical unit vectors, are given by the 
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z 


Fig. 11.6. Displaced axes with its origin at the instantaneous location of the moving 
particle at time t. 


following formulas: 
y=rsiné cos¢d, e, = sin@(cos¢j + sin¢k) + cos Gi, 
z=rsin@ sind, eg =cosé(cos¢j+sin¢k) — sin Gi, (11.64) 
&=r cos8, eg = —singj+cos¢k. 

The (E, B) field of Eq. (11.63) can be written as 


ae 


E(r, 0,9) = ky er, 


(11.65a) 
cB(r, 0,0) = BEsin 6 eg. (11.65b) 


To get a make the picture complete, we need to express R in terms of 
r,@ and 6, beginning with Eq. (11.60), exploiting the Cartesian > spherical 
conversion formulas (11.64), and using the relation (11.60): 


Rae +y? +2? =7?r? (1-6? sin’ 6). (11.66) 
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Therefore, we can rewrite Eq. (11.65) as 


k 

E(r, 6, ¢) = ——_—-——— e,, (11.67a) 
yr? (1 — B? sin? ) - 

cB(r, 6, ¢) = nea (11.67b) 


—__________ ¢... 
yr? (1 — 8? sin? ) a 


If we write the magnitude of the E field along the Y Z-plane (0 = 7/2) 
as £,, and along the direction of motion (9 = 0) as Ey, then 


_ ky 


= — Ae 
Brae Bae PL/By = Lee) 


To get a clear picture of the angular distribution of the field, we shall 
plot E on a “unit sphere”, which we define as 


mr =Vk= q 3 
\ ATED 
1 


so that, from (11.67) E(ru,0,¢) = 


(11.69) 


—______—_x e,. 
1? (1 — 6? sin? 9)? 

Figure 11.7 shows plots of this field surrounding the moving point 
charge, on a unit sphere, corresponding to three values of 6, namely 
6 «1,8 = 0.8, 8 = 0.95. The E/E ratios for the above three val- 
ues of 8 are shown in Table 11.1. The length of the arrow in each figure is 
proportional to the strength of the field. We have plotted the field exactly 
(i.e. the lengths of the arrows exactly) using Gnuplot. 


Table 11.1. E/E for 3 values of f. 


B Y E/E 
<1 ~) 1 
0.8 1.66 4.63 


0.95 3.2 32.84 


Time variation at any given observation point P 


The observation point is taken on the Y-axis, at x = 0,y =1. Substituting 
these values in (11.63a), and setting oe = 1,c = 1 we get the z- and 
y-components of the E field (its z-component is zero). 


et 
[(7Bt)? + 1)3/2” 


 ——— —— (11.70) 


Ey (Be + ye 
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= 


*Edata—151123B.txt? using 1:2 + 
*Edata—151123B.txt’ using 1:2:1:2 


0 1 2 3 
*Edata—151123B.txt’ using 1:2 + 
*Edata—151123B.txt’ using 1:2:3:4 — 


Fig. 11.7. Electric field lines on unit sphere corresponding to 6 < 1,= 0.8, = 0.95. 
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Fig. 11.8. (a) and (b) Plots of the E,, Hz as functions of time, corresponding to 8 = 
0.4, 0.6, 0.8. (c) Ion A passing an atomic electron with velocity cf in its passage through 
matter. 


We have plotted these two fields as functions of t, and corresponding to 
8 = 0.4, 0.8, 0.95 in Figs. 11.8(a) and 11.8(b). 


11.6.3. Ionizing effect of a heavy ion in its 
passage through matter 


The formula (11.63a), and its time plot shown in Figs. 11.8(a) and 11.8(b) 
have important and interesting applications in nuclear physics. They are 
used to calculate the ionizing effect of a “heavy ion”, e.g. a proton (hydro- 
gen ion), a-particle (helium ion), as they pass through matter with high 
velocity, close to the speed of light, and the Range of such particles in 
matter (i.e. distance travelled before being stripped). 

We shall get an appreciation of the “unit radius” defined in (11.69) with 
a realistic example, say a “heavy charged particle” A of positive electric 
charge ze (e.g. a proton with z = 1 or an a-particle for which z = 2, the 
corresponding “light particle” in this case is electron). As the ion moves 
through a piece of matter, e.g. gold foil, or an aluminium strip it ionizes 
the atoms, by pulling out electrons with its electric force. To make things 
simple we take the ionizing particle to be a proton, for which q = e = 
1.6 x 10—!9 C. Setting = = 9x 109N.m?/C?, we get ry = 3.8 x 10-° m, 
or, 38 microns. This “unit radius” is therefore quite large compared to the 
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radius of the ionized atoms. We have used it only for the convenience of 
plotting, without attaching any further significance to it. 

The phenomenon we are talking about is a collision of A with e~ (even 
though they do not physically touch each other), depicted in Fig. 11.8(c). 
The particle A of charge ze moves along the X-axis with velocity v = cf, 
encountering an atomic electron of charge —e, located at P, at an impact 
parameter b. Being heavy (infinitely heavy by assumption), it moves along 
without being deflected from its straight path. As it zips past by the atom it 
exerts a short burst of electric force F, on all electrons in the atom, lasting 
for a very short time At, compared to the time period of the electronic orbits 
(pre-quantum classical picture). In this short burst, the movement of the 
electron is so small that it can be treated to be stationary. (Obviously, the 
magnetic force F,,, will have practically no effect.) As a result, some of 
the electrons in the atom will receive enough energy to be ejected out of 
the atom, thereby ionizing it, while some others will be excited to higher 
energy levels. 

Let us get back to some calculations. It is seen from the plots in Fig. 11.8 
that FE, is antisymmetric, hence no effect when integrated over the t-axis. 
In contrast Hy is symmetric. When integrated over the t-axis, it will give a 
net area under the curve, a4 + hot, where h = ae and 6t = 2t1/2 = mae the 
width of the curve at “half height”, standing for the effective “duration of 
the impact”. (Reader may verify the estimate.) Then @ ~ ma The passage 
of the heavy particle gives a sharp impulse, i.e. a momentum transfer dp = 
ze" 2 


cern oe the negative Z-direction (net pulling effect of the traversing 


particles). The corresponding energy transfer is 


ea = 


eS ae (11.71) 
2me = (AE)? Mev?b? 
The above formula has important application in nuclear physics [36-38]. 
A charged particle, like a-ray, while passing through matter, keeps losing 
its kinetic energy by transferring it to atomic electrons, thereby liberating 
them, and producing electron-ion pairs, and ultimately getting stopped 
after traversing an average distance R, called its range in the given material. 
For example, the 5.4 MeV a-particle [38] emitted by the polonium isotope 
P?!°. carrying energy 5.3 MeV, has a range of about 10 cm in dry air. As it 
passes through, the material it keeps losing energy, by producing electron— 
ion pairs, which in this case is about 6600 pairs per mm. 
We shall not pursue this matter further. For an exact calculation of 
these effects and an estimation of the Range of the particle, the interested 
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reader can look up standard books in Nuclear Physics as cited in the 
footnote. 


11.7. Exercises for the Reader III 


Problem 11.1. The relative nature of the electric field E and the mag- 
netic field B can be seen in an elementary manner as follows. Let a par- 
ticle of electric charge gq move along the X-axis with the velocity cG, as 
seen from a Lab frame S,, with respect to which there is a uniform mag- 
netic field B = Bj and no E field at all. According to the Lorentz force 
equation (11.30), the particle experiences a magnetic force F = qcG Bk. 
With respect to the rest frame S’ of the particle, however the particle is 
at rest, and hence, there is no magnetic force. The same magnetic force 
therefore now appears as an electric force. Therefore, there is an electric 
field equal to E = cG Bk, in the Z-direction. 

The answers suggested above are only approximate in view of the fact 
that we did not apply the correct force transformation equation (8.55) to 
obtain the force in S’. (a) Apply the suggested correction to obtain the E 
field in $”. (b) Check your answer using the LT equation (11.37) for the 
electromagnetic field under the boost:$'(c8,0,0)S’. (c) Does the absence of 
“magnetic experience” by the particle in the rest frame imply vanishing B 
field in S’? Check your answer using (11.37) again. 


Problem 11.2. The conclusions of Problem 11.2 raise many paradoxes. 
A “pure” magnetic field in S' can be produced by a “neutral” wire carrying 
electric current, as is usually the case for a d.c. circuit. How can such a 
wire cause an electric field in another frame of reference? 

To solve this paradox, consider a line current J flowing through a 
straight wire along the X-direction in a Lorentz frame S. This current 
can be considered to be due to a stream of negative charges (namely, the 
electrons) of charge density pneg = —Po with a drift velocity —c along 
the X-axis, overlying a stationary distribution of positive charges (namely, 
positive ions at the lattice points of the conductor) of equal and opposite 
charge density ppos = Po. Assuming that the wire has a circular cross- 
section of radius a, the electric current density and the charge density p for 
this configuration are given as follows: 


; PoC,r <a ; 
je = > Jy = Jz =0; P = Pneg + Ppos = 0. (11.72) 
0, r>a 
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This configuration produces only B field, but no E field. 

(a) Find the B field at the point P(0,y,0) on the Y-axis produced by the 
above current. 

Hint: The (E, B) fields at P due to a line charge density \ and a line current 
density I, both along the X-axis, are given as 


I 
A Boa" Ee (11.73) 


- 2TEO Y . 2TE9 CY 


(b) Using (11.37) show that the EM field at P, as seen by an observer S’ 
who is moving along the X-axis with velocity c, is 


_ wb poa? | 


2 
VPoG 
: peasy =f eno 
2E0y ey 2 


2eoy 
(11.74) 


El =E=0,E,= 


(c) Using LT of the 4-current density shown in (11.72) show that the electric 
current density and charge density with respect to S’ are 


+ Poyeu, r Sa y + / / / 
de> 0 ss ; Jy Sg 0; po= Preg + Ppos = — WB Po- 
3 Hi a 


(11.75) 


(d) With the help of (11.73), and using the current and charge densities 
shown in (11.75) obtain the (E,B) fields at P as measured by S’. Verify 
that you get the same answer as in (11.74). 


Moral: A wire which is neutral with respect to a Lorentz frame S is not 
neutral with respect to another Lorentz frame S’, which is why there is an 
E field in S’, in addition to the B field due to the electric current. 


Problem 11.3. We saw in the last two problems that a pure magnetic field 
in one Lorentz frame will appear as a combination of magnetic field and 
electric field in another. We shall examine the reverse of this case in this 
problem. Consider a frame S in which the field is purely E, there being 
no B-component. The force on a charge g moving with velocity v = cv is 
therefore F = gE. 

(a) Using the force and velocity transformation formulas (8.55) and (4.23) 
(and their inverses if necessary), show that the force components in the 
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frame $’, under the boost: S(c8,0,0)S’ becomes 
Fi = q|Ex — By(uj Ey + v4 E2)], 
Fy = ay(1t Buz) By, (11.76) 
FL = qy(1t Buz) Ez. 


(b) Hence, show with the help of (11.30) that there are both E and B fields 
in S’, given as 


Ei, = Ex, E;, = yEy, Ey, = YEz, 
z (11.77) 
cB, =0, cB, = yBEz, cB, = —yBEy. 


(c) Check Eq. (11.77) against the general field transformation 
equation (11.37). 


Part IV 


4-Miomentum Conservation 
in Continuous Media 


This page intentionally left blank 


Chapter 12 


The Energy Tensor 


12.1. Why Energy Tensor? 


Forces of gravity originate from massive objects like the earth, the Sun, 
the stars — as Newton’s theory of Universal Gravitation tells us. Mass 
loses the pristine purity enjoyed in Newtonian mechanics, because the rel- 
ativistic mass is no longer constant (being a function of velocity), and the 
rest mass is no longer conserved. What replaces mass is energy, thanks to 
E€ =mc’. Energy, in turn, is the time component of a larger entity, namely 
4-momentum (or, En-Mentum, a term coined in Sec. 8.4). 

For an analytical study of gravitational field in Newtonian formalism, 
its source is expressed in terms of mass density. That density now needs to 
be replaced by some 4-momentum density. However, density being some- 
thing per unit volume — and a unit volume in one frame being not so 
unit in another — the search for an appropriate density leads towards 
a 4-tensor which is generally known as energy-momentum stress tensor. 
We would however prefer to give it a shorter name — energy tensor. 
Einstein’s General Theory of Relativity traces the source of gravitation 
to this energy tensor. There is another reason for knowing the energy ten- 
sor. Even within the ambit of special relativity, it is a legitimate urge to 
write the conservation equations for energy and momentum in the appro- 
priate language, namely covariantly. One would stumble upon the energy 
tensor in this attempt to write covariant conservation equations, which 
is also a prerequisite to the study of relativistic quantum mechanics and 
quantum field theory. The concept of energy tensor is built upon a more 
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elementary three-dimensional base namely, the stress tensor. We had pre- 
sented a detailed exposition of stress tensor in Sec. 5.2, and of Maxwell’s 
stress tensor in particular in Chapter 6. The volume force density f,(r) in 
matter (solid or fluid) at a point r was shown to be the divergence of the 
stress tensor field T(r), in Eq. (5.79), which we are rewriting here: 


f,(r) =V- T(r). (12.1) 


12.2. Minkowski Volume Force Density 


Passing from Euclidean space to Minkowski space-time, one might expect 
the four-dimensional generalization of stress tensor to be a 4-tensor field 
whose 4-divergence would yield Minkowski volume force density. However, 
density functions do not exhibit well-defined transformation property unless 
volume is measured in the comoving frame. We had touched on this aspect 
in Sec. 11.3. We transplant Fig. 11.4 from that section here and relabel it 
as Fig. 12.1. It shows a stream of particles constituting a fluid in motion. 
An infinitesimal volume dV (shown coloured in the figure), identified at 
the event point (a) = (ct,r), contains a collection of fluid particles, which 
together possess a rest mass dm,, a quantity of charge éq, and is moving 


af 


(XY; 


ZT | 


Fig. 12.1. Lab frame S and comoving frame Sp in a streamline flow of particles. 


The Energy Tensor 321 


with the velocity u(r, t) with respect to the Lab frame S. The correspond- 
ing 4-velocity U“ and the Lorentz factor I(x), copied from Eqs. (11.24) and 
(11.20) are 


U" (x) =T(a2)(c, u(x)) = P(e, wu, ua, us), (12.2a) 
(a2) = —— (12.2b) 
(eae) 


Lorentz contraction of the dimension of this box along the direction of 
u, changes its proper volume dV, to the laboratory volume 
1 
T(x) 
Let the Minkowski force acting on these particles (inside the proper 
volume dV,) be 5F (x) =) OF" (x), the corresponding 3-force 6F(x), and 
the power received dII(z). Then from (8.31) 


5V = 5Vo, or, 6Vo =T(ax)dV. (12.3) 


= 6V, (a. oe (12.4) 


We define Minkowski volume 4-force density F (c) to be the Minkowski 
force per unit proper volume — the 3-scalar density @ (to be pronounced 
as var-pi) as the power received per unit lab volume, or power density 
and 3-vector density f(a) as the 3-force per unit lab volume, as explained 
below: 


F (2) = vim, zac) (12.5a) 
a(r) = iim, | (12.5b) 
f(x) = im, ss (12.5c) 
It follows from (12.4) and (12.5) that 
SF ia) Say, Fie), (12.6a) 


where FOS (=, f(x)) (12.6b) 
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An example of Minkowski volume 4-force density F (2) and its time and 
space components can be seen in Eq. (12.20a). 

At this point, we shall remind the reader of the convention we adopted 
in Sec. 8.1 and onwards. A 4-vector A can also be written in terms of its 
4-components {A“; (4 = 0,1,2,3)} as A = e, AM. A 4-tensor can also 
be written in terms of its 4 x 4 components {T”; (u,v = 0,1,2,3)} as 


aT a Tee . Sometimes we shall ctl to a 4-vector or a 4-tensor as a 


whole ‘ nee object” like K and T. Sometimes we shall write the 


same quantities in terms of its components as A“ and TH”. = 


The 4-stress tensor is now defined to be a symmetric tensor: T(x) = 
e, T"” (x) @p of rank 2, satisfying the requirement: 


(12.7) 


We may as well call it Minkowski 4-stress tensor. 

Note from (7.113) the time and space components of the operator are 
given by V- (22, Vv). 

In the next few sections, we shall work towards such a tensor for a stream 
of incoherent dust, consisting of electrically charged particles, subjected to 


only electromagnetic forces. 


12.3. Energy and Momentum Conservation in One Voice 


Before proceeding further we shall apply Lorentz force (11.30) to a volume 
distribution of electric charges. Write the Lorentz force dF on the charge 
content 6q inside an infinitesimal volume 6V, by replacing q with poV, 
where p is the charge density as defined in Sec. 11.3. Now divide each side 
with dV, and use the definition of volume current density J, as defined in 
(11.19). The Lorentz force density at the event point (a) is now 


fan= im, 7 = =pE+JxB. (12.8) 
There are two important theorems that are used to state the conserva- 
tion laws involving the electromagnetic forces. We shall state them as two 
theorems, because they follow directly from Maxwell’s equations. 
Consider the same stream of charged particles subjected to electromag- 
netic forces of their own creation. We shall apply energy and momentum 
conservation theorems to this system of particles. 
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(A) The energy theorem, also called Poynting’s theorem* is written as 
follows: 


E. 3+ %=-v-8, (12.9) 


We have proved the above theorem in Appendix A.1. 
We interpret the terms appearing in the above equation as follows: 


E- J = work done by the field on the fluid particles per unit volume, 


= rate of change of kinetic energy per unit volume, (12.10a) 


w= <(E + cB?) = field energy density, (12.10b) 


S = egc?(E x B) = field energy flux density = Poynting’s vector. 
(12.10c) 
All densities alluded to in the context of energy-momentum theorems 
(12.9) and (12.13) are lab densities. See comments after Eq. (12.4). 


To justify the above interpretation we integrate over a volume V 
bounded by a surface S', and applying Gauss’s theorem, we get 


i. (z. 54) w= IIf, V -Sdv 


--f S-da. (12.11) 
Ss 

LHS = rate of change of [mch energy + fld energy] inside V. 

RHS = — outflux of fld energy across S = influx of fld energy across S. 
Therefore, 


rate of change of [mch energy + fid energy] per unit volume = influx density 
of fld energy per unit volume. 
Our interpretation is justified. 


(B) The momentum theorem: 
We proved the following theorem, which follows from Maxwell’s equa- 
tions, as Eq. (6.50c). 


(pE+J x B)+ {2 (e9E x B)} =V-F em): (12.12) 


aSee [13, pp. 258-261]. 
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We shall interpret the two terms on the LHS as follows. The (E, B) 
field exerts a force on the existing charge-current distribution according 
the Lorentz force equation. The first term represents this force fem, as in 
Eq. (12.8), equal to the rate of change of the momentum of the particles 
per unit volume represented by P, which we shall refer to as mechanical 
momentum density. 

However, when these fields start changing with time they create a prop- 
agating em field which carries away energy and momentum. The second 
term should represent the rate of change of this field momentum density, 
to be represented by the symbol g: 


e Ss 
9g eh €9(E x B) =_ a (12.13) 
Now we can rewrite Eq. (12.12) as 
OP Og x 
atime Vv: : 12.14 
ap ee ee (12.14) 


To justify the above interpretation, we shall integrate (12.14) over a 
volume V bounded by a surface S, and applying Gauss’s theorem, we get: 


a (Uff,2e) va Mle) Ill, (0 Fo) 
= | | _Ttem mr) da. 


(12.15) 


LHS = The rate of change of [Mch momentum + Fld momentum] inside V. 
RHS = Total em force transmitted across S = Influx of fld momentum 
across S. 

Hence, we interpret Eq. (12.15) as saying that 

Rate of change of [Mch momentum + Fld momentum] per unit volume = 
Influx density of Fld momentum per unit volume. 

Our interpretation is justified. 


Equations (12.9) and (12.14) are two equations expressing conservation 
of energy and momentum, separately. The spirit of relativity will demand 
that they should be integrated into a single equation, unifying conservation 
of energy and momentum as a conservation of En-Mentum (a name for 
4-momentum coined in Sec. 8.4). As a first step towards this we rewrite 
Eqs. (12.9) and (12.14) in such a way that the left-hand side will represent 
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the charged particles and the right-hand side the em field: 


p3=-|%iv-s|, (12.16a) 
pE+JxB= -3 +V + Tem) (12.16b) 


In the last equation, B (om) is the momentum “outflux density”, equal and 
opposite to momentum “influx density” T ey: See Eq. (6.53). 

Equations (12.16a) and (12.16c) represent the time component and the 
space components of one 4-vector equality. 

The right side terms can be combined into a 4-vector, which we shall 


define to be the negative 4-divergence of a 4-tensor M, namely the Mazwell’s 
energy 4-tensor. This tensor is an upgradation of the Maxwell’s stress 
T (om) defined in Eq. (6.46), except that (—T (em))s defined as Boom); forms 
the 3 x 3 core of this upgradation. The 4 x 4 components of this tensor will 
be written as M"”. The time and space components of the new 4-vector 
are as follows: 


1 e 
Time component: — G +V- s) 7M, (12.17a) 
c 
Og > def awk 
Space component: BE +V-®oem)| = Vai; k= 1, 2,3. 
k 


(12.17b) 


The subscript & on the left-hand side implies x,y,z components of the 
vector corresponding to k = 1, 2,3, respectively. 

It is now easy to identify the 16 components of the Maxwell’s Energy 
4-tensor M by taking a close look at Eq. (12.17), and recalling the com- 
ponents of the operator Vq shown in (7.113). Equation (12.17a) yields 
the components of the column 0, and Eq. (12.17b) the components of the 
columns k = 1,2,3. Remember that cg = S/c, according to (12.13). For 
further help, see Appendix A.3. Moreover, 

0 1 2 3 

w  Sz/c Sy/e Sz/c 
See Oey Pan am (12.18) 
Syfe Of, 02%, O28 | 


S,/¢ OF, Gon Gin 


M""’ (2) = 


wnro 
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We have written ®11,,6)2,... to mean 627, 67% ..., respectively. Note 
that M"” is symmetric and traceless. 
Me“ = MY’ 
, 12.19 
M*, = 0. ( ) 


Both properties are Lorentz invariant, i.e. same in all inertial frames. 

The left side terms of (12.16) can be combined into another 4-vector, 
namely Fhe: the Minkwski force per unit proper volume from the em 
field on the charged particles in the fluid, thereby changing the En-Mentum 
of the particles of the charged fluid: 


1 
Ftasmat(2) = (= J, pE+J x B) (12.20a) 
= 1 [La 
= Gp Fo, (12.20b) 


where F"® and Ja are, respectively, the electromagnetic field 4-tensor and 
the electric current density 4-vector, both of them defined in Chapter 11, 
as Eqs. (11.36) and (11.25) respectively. 

Equations (12.20a) and (12.20b) are analogous to Eqs. (11.32) and 
(11.34) of Chapter 11. In fact (b) follows from (12.20a) in the same way as 
Eq. (11.25) follows from Eq. (11.32). 

The conservation equations for 4-momentum, appearing disjointedly as 
(12.9) and (12.14), will now join into the following single 4-equation?: 


1 
< FHO J, = —Vg MP*(2:). (12.21) 
Cc 


12.4. Euler’s (Non-relativistic) Equation of Motion 
for a Perfect Fluid 


Our objective now is to construct the energy tensor of the simplest “closed 
system”. The term “closed” in this context means that the system is self- 
contained in all its dynamical behaviour, i.e., all dynamical processes take 
place due to forces of interaction within the system, there being no scope for 
exchange of energy and momentum with anything outside. The total energy 
and the total momentum of a closed system are therefore fully conserved. 


bSee [33, p. 152]. 
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A closed system contains both matter and forces. The only kind of 
classical forces that can receive relativistic treatment are electromagnetic 
forces. Before linking up matter with electromagnetic forces, we shall con- 
sider an oversimplified model which consists of matter in the form of perfect 
fluid — sometimes also called “classical fluid” — moving under the influ- 
ence of internal and external forces whose origin we need not specify at 
this moment. We shall first lend a non-relativistic treatment to this fluid, 
so that transition to a relativistic formalism becomes smooth in the next 
section. The equation of motion of this perfect fluid is known as Euler’s 
equation. 

By perfect fluid, we mean a fluid which does not offer any viscous forces, 
which as the reader knows, causes shear stresses in the fluid. A perfect fluid, 
whether at rest or in motion, can sustain only normal compressive stresses 
inside, familiarly known as “pressure” . 

Let us consider a fluid in streamline motion as previously illustrated 
in Fig. 12.1. This fluid is characterized by a velocity field u(r,t) and a 
fluid mass density o(r,t), both of which, in general, are unsteady fields, i.e. 
functions of t as well. The divergence of u is called dilatation, a term we 
shall explain with the help of Figs. 12.2(a) and 12.2(b). 


Fig. 12.2. Explaining fluid motion. 
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We have shown a stream of fluid in motion, inside of which we have 
marked out a volume V at time t. Since the fluid particles on the surface S 
of V have different velocities u(r, ¢), the boundary S not only moves with 
the particles lying on it, but also changes to a different shape 5S” (shown 
with broken line) at the time t + dt. Consequently, V will also change to a 
different volume, say V’. 

Consider a film of fluid particles lying over a tiny area da centred at 
the point P. These particles move a tiny distance udt from P to P’ in time 
dt. In this time, a volume of fluid 6v flows out from V, crossing the tiny 
surface area da. The volume that flows out is 6v = [u- njdadt. 

There are certain regions of S, say at P, where u-n is positive, and the 
outflux (i.e. volume outflow) is positive. There are some other regions, say, 
at Q, where u-n is negative, and the outflux is negative. The net outflux 
of fluid volume is the surface integral of u over the boundary surface S. 
This can be written as 


dV =V'-V= [| oneal dt = ihsau dt, (12.22) 


where we have used Gauss’s theorem to convert the surface integral to a 
volume integral. We reduce the finite volume V to an infinitesimal volume 
dV, thereby avoid integration, and get 


d(6V) = [(V - u) oV] dt. (12.23) 
Therefore 
1 d(sV) _ 
3d V-u. (12.24) 


In other words, V - u is the rate of change of volume per unit volume — 
or, more compactly dilatation. 

Now we take up equation of motion properly. Consider a fluid ele- 
ment consisting of an infinitesimal collection of fluid particles moving 
along the stream (Fig. 12.2(c)). At the time ¢ its centre of mass is 
located at P where it occupies a volume dV. The mass of this element is 
dm = o(r,t)oV, its momentum dp = dmu((r,t)) and the force impressed 
on it OF = f(r,t)dV, where f(r,t) represents the volume force density. 


“See [34, p.136.] 
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Applying Newton’s second law of motion to this fluid element, 


< (op) = OF, (12.25a) 


d 
or, [6m u(r, t)] = f(r, t) dV. (12.25b) 


Note that in the above equation os represents convective derivative 
whose meaning we shall explain with a more general example. Let there 
exist a certain field f(x,y, z,t) in the fluid (e.g., temperature, fluid veloc- 
ity, pressure). The value of this field at the location of the particle changes 
from f(x,y,z,t) to f(a + dx,y + dy, z + dz,t + dt) as the particle moves 
with velocity u from the location r = (x,y,z) at the time t to take up a 
new location r + dr = (a + dzx,y + dy, z + dz) at the time t + dt. The net 


change is 


Of Of OF Pee 
oa op Oe OE 


= [2 af of | ar 


Ug + ——Uy + =u, + dt 


Ox Oy Oz Ot 


= lw. V+ 5 | f(x,y, 2, t)dt 


= (F) we (12.26) 


We have attached a subscript “c” to stress that the time rates of the changes 
of physical quantities in motion are given by their Convective Derivatives. 


d e a) 
# = (F). = (a V+ =) f(x,y, z.t). (12.27) 


Using Eqs. (12.24) and (12.27), we establish a few relations for future ref- 
erence: 


“0 5] = cov +00 
2 {zs (u- Vo Liat. u}] av (12.28a) 
= - Lv. (ou)| BV, (12.28b) 


where o represents any density function, of which the mass density is par- 
ticular example. 


330 Special Relativity, Tensors, and Energy Tensor 


In N.R. physics mass is conserved. Consider the two terms in the first 
equality in (12.28a). The first term, if positive, means increase in mass in 
dV due to density fluctuation. The second term, if positive, means increase 
in mass due to volume fluctuation. However, both of them cannot be pos- 
itive. Increase in one term is nullified by decrease in the other. Together 
they represent zero change. We get back the mass conservation equation, 
known as continuity equation. 


d Oo 
ale bv] =0 => BE +V-(cu) =0. (12.29) 


We shall convert Eq. (12.28) to a momentum equation. Replace the 
scalar density o with the density of the z-component of momentum cu, in 
the above equation, and get 


<((ous) éV] = ae +V- (ousu)| av. (12.30) 


The above relation holds for all the three components uz, uy, uz. Multiply- 
ing the components with e;,e,,e, and adding them together, we get 


“ou bV] = low +V- (ouu)| oV (12.31) 


Going back to Eq. (12.25), noting that 6m u(r,t) = ou dV and using (12.31) 
we get the general equation of motion for the fluid: 


0 
Bt ow) +V-(cuu) =f. (12.32) 


Equation (12.32) is the general equation of motion of a fluid, to be referred 
to as the Euler’s equation. 


12.5. Relativistic Equation of Motion for a 
Continuous Incoherent Media 


We shall upgrade the E? version of the fluid equation of motion (12.32) to 
M*. The starting point of the former was (12.25a). The starting point of 
the latter will be the M+ version of this equation, i.e. Eq. (8.28) in which 
we set B+ 5P and F — 5F: 


a sii, (12.33) 


Here 5p is the 4-momentum of the mass content of the same fluid volume 
dV considered in Sec. 12.4, and 6F is the Minkowski force on this volume. 
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We shall write the left and the right side of the above equation 


in 
sp = @/, Op"; so that ate = é} () ; (12.34a) 
© 


dt 
5F = f (x) 5Vo = Sf (f¥(x) 6Vo). (12.34b) 
Then the EoM (12.33) can be written in the form: 
déph _ i 
P= f(x) Vo, (12.35a) 
ddpt 
or °P = f(x) ov. (12.35b) 


To go from (12.35a) to (12.35b), we divided each side with T, and recalled 
Eqs. (4.16) and (12.3). 

If we make use of the expression for Ff given in (12.6), the EoM takes 
the form: 


(12.36) 


Equation (12.36) will be the backbone of our arguments. It states that 
the rate of change of 4-momentum of an infinitesimal volume dV of particles 
at the event point (x) is equal to the total 4-force acting on this volume. 
We shall convert it to a beautiful form in Eq. (12.44) 

At this point let us be aware that mass is not conserved in relativistic 
mechanics. Mass conservation is violated, even if infinitesimally, in all real 
situations. Mass of a system changes when chemical reactions take place, 
when atoms absorb or emit light, when a gas expands or is compressed. 
Even for the perfect fluid, whose dynamics was given a relatively simple 
non-relativistic treatment in Sec. 12.4, its mass is continuously changing 
because of the work being done by fluid pressure. This effect has to be 
taken into consideration. 

To make our task manageable, we shall think of a system of particles 
forming a tenuous fluid in motion. The constituent particles move along 
streamlines without a bond between them, i.e. no collision occurs. Also, 
the constituent particles — atoms, molecules, nuclei, electrons — whatever 
they may be, remain in their original ground states through the dynamical 
processes, and, hence, do not emit or absorb light, so that their rest masses 
do not change. The particles are charged, and the electromagnetic field 
created by their charges determine their equation of motion. 
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Let Fig. 12.1 represent a segment of this flowing fluid. An infinitesimal 
volume 6V of this fluid, at the event point (a), possesses a rest mass 0mo, 
which is the sum of the rest masses of all the constituent particles inside dV. 
That is, 6mo = ye Moi, Where ON is the number of particles inside 6V 
and m,; is the rest mass of the ith particle in this infinitesimal collection. 
Let o, stand for proper density of rest mass, which we define as 


dmo 


= ——. 12: 
5V,-40 bVe” at) 


do(x) 
where dV, is the proper volume of the above collection, i.e. volume mea- 
sured in the instantaneous rest frame. In contrast to o,, we use another 
symbol o to denote density of relativistic mass in the observer’s frame S. 
Seen from the observer’s frame, the above collection of 6N particles are now 
confined within a smaller volume 6V = 6V,/T and the relativistic mass of 
this collection is 6m =Tdmpo. Therefore, 

om Tl’ d6mo 

= lim == lm =, =!I"o,(z). 12.38 
72) = Mo BV ~ ato Vefr) 7 2.88) 

We shall work out the equations of motion of the energy and momentum 
content of the volume dV. The relativistic mass of this volume is 


dm = o(a)oV. (12.39) 


Therefore, according to formulas (8.19) and (8.18(b), 8.18(c)) of Sec. 8.4, 
the 4-momentum of the mass content within this volume is dp“ = (dp°, dp) 
where 


dp” = bme = (a bV)e, 
dp = bmu= (a 6V)u. (12.40) 


Let us now go back to the equation of motion (12.36). We shall expand the 
left-hand side corresponding to 4 = 0, using the time component of dp” as 
given in (12.40). With some help from (12.28): 


Oo 


OE oe ehaa Ee ae; (ou)| iV, (12.41) 


dt “dt ot 
and corresponding to pp = 7 = 1,2,3 in a similar way with help from (12.31): 


ap = “(ou 5V) = — LY (ouu)| iV. (12.42) 
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We shall combine the above two equations to obtain the following 


identity: 


dp! ({ 2 


(12.43) 
We have used Eq. (8.7) which gives the 4-velocity U” = T(c,u), in which 
u is the velocity of a particle at the event point (2). 
Going back to (12.36) we rewrite the same EoM in the compact form: 


Valoo(UPU*)] = f*, (12.44) 


which is the EoM for a system of incoherent dust subjected to 4-force 
= A i per unit proper volume. 
Much of our discussions to follow will be based on Eq. (12.44). 
The compact four-dimensional EoM (12.44) has the following (time— 
space) components: 


(co) +V- (cou) — +V- (oun)} = (=, r) _ (12.45) 


The time part is the energy equation, and the space part the momentum 
equation. 

We now define the energy tensor of the incoherent fluid (also called 
incoherent dust) as* 


Dev 2g UHUY, (12.46) 


It is now very easy to identify the 4 x 4 components of D#”: 


0 1 2 3 
c CUg: CUy CUz 


DY’ = o,T? x lig? U2 Ugly Ugly : (12.47) 


2 
UyC UyUs uy UyUz 


Uz UzUg, UzUy uz 


wnr oo 


4See [34, p. 141]. 
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The EoM, written as (12.44), takes the beautiful comprehensive form: 
VD" =f”. (12.48) 


12.6. Energy Tensor for a System of Charged 
Incoherent Fluid 


We prepared the groundwork for this section in Sec. 12.3, in particular 
through Eq. (12.21). Before proceeding further we shall recognize the fol- 
lowing two volume 4-force densities. 


0 
7 ase = (faa—mat F td sinat) (12.49a) 
= 4-force per unit proper volume from the em fid 


on the particles in the dust, 


cae — caer Fats) (12.49b) 


= 4-force per unit proper volume from the particles in the dust 


on the em fid. 
Let us now understand the effect of above two 4-force densities. 


Rate of change of matter 4-momentum per u.p.v. at(a) 


1 1 
= [= B-J,pB+IxB| = — FeO, 
Cc c 


= Ff narmas(2); (12.50a) 


Rate of change of field 4-momentum per u.p.v. at(x) 


1 (Ow Og > 
= | (e+v-s) F ($2 +: Bon)| = Veg M*(z) 


c 
= F mat+na(2) (12.50b) 


We can now go back to (12.21) and rewrite the 4-Momentum conserva- 
tion equation as°®: 


FT sae = ao ih (12.51) 


The above equation represents a generalization of Newton’s third law of 
motion for the 3-forces of action and reaction to the 4-forces of action and 
reaction between a charged fluid media and its own electromagnetic field. 


°See [33, p. 152]. 
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The EoM of the charged dust is given by Eq. (12.48), in which the 
“force” f# is now the electromagnetic force on matter, ie. ffa mat = Sens 
as given in (12.20), exerted by the em field originating from the charge— 
current density J’ present in the matter itself. 

The reader may re-read the statement following Eq. (12.36) to clear 
a possible doubt. The quantity F nasmat(2) OV is the 4-force exerted on 
matter inside the elementary lab volume 6V, by the charge-current distri- 
bution residing on the rest of the volume V —6V, outside dV. In this sense, 
this 4-force is an “external” 4-force (not a “self force”) determining the fate 
of the matter inside dV. The quantity (V_~D"(«)) dV is the time rate of 
change of the 4-momentum of the particles in this volume, according to 
Eqs. (12.46) and (12.43). 

The EoM (12.48) is now written as 


VaD™ (2) = ee 


But, Figsass = sof ere =—-Va Me (a), (12.52) 
by (12.51) and (12.50b) : 


Hence, V_D°" (x) = —Vq MH (ar). 


The system consisting of the matter (represented by D““) and its own 
em fld (represented by M°) is now a closed system. Its energy tensor is 


T%# (x) = D(x) + M%(z), (12.53) 


dust 
satisfying 


Valk =0. (12.54) 
In Newton’s theory of gravitation a massive star, or a massive planet, is 
the source of gravitation. In Einstein’s General Theory of Relativity mass 
is replaced by energy. However, energy itself has no respectable status, 
because energy is the time component of 4-momentum. Hence, energy is 
replaced by 4-momentum, and energy density (analogous to mass den- 
sity) by the energy tensor, which is loosely the density of 4-momentum. 
Since a star is an isolated object, its energy tensor must have zero 
4-divergence. Equation (12.53) gives the simplest example of such an energy 
tensor, and Eq. (12.54) tells us the desirable property of such a source of 
gravitation. 
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12.7. Energy Tensor of a Closed System 


By the defining property (see first paragraph of Sec. 12.4), the energy and 
momentum of a closed system are strictly conserved. This conservation 
statement is succinctly expressed in the mathematical equation: 


Val (x) = 0, (12.55) 


where T°(a) is the energy tensor of the system at the event point (x). 
Equation (12.54) gives an example of such a tensor. 

Equation (12.55) represents four equations, of which the ~ = 0 com- 
ponent expresses energy conservation, and the uw = 1,2,3 components the 
momentum conservation, since energy divided by c, and the momentum 
3-components, together, constitute the 4-momentum p". 

Before we proceed towards the construction of the energy tensor of a 
perfect fluid (which we take up in the next section), Eq. (12.55), and its 
full import will help us in our mission. 

To see the energy part of (12.55) we set 4 = 0. Now multiplying either 
side with c and expanding we obtain: 


aT =. a(cT*®) 
es, + i 
Ot Oxk 


=0. (12.56) 


This becomes the energy conservation equation, conforming to the format 

of the continuity equation shown way back in Eq. (11.14), by identifying: 
Sk 

, ee (12.57) 


c 


TT = w 


where w stands for the energy density, and the 3-vector S for the energy 
flux density. Equation (12.56) can now be rewritten in the standard form 
of the continuity equation: 

Ow 

—+V-S=0. 12.58 

aE (12.58) 

In a similar manner we set ps = 1,2,3 to view the momentum conserva- 

tion part of the equation (12.55), which yields: 
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As before, the above equation becomes the conservation equation for the 
ith component of momentum by identifying: 


T(x) =cg'(x); T(x) = kth component of T'(z), (12.60) 


where g(x) = (g!(x), g?(x), g3(x)) stands for the momentum density vector, 
and T’(x) = (T(x), T?"(x), T**(x)) for the flux density vector for the ith 
component of momentum. Hence, we can rewrite Eq. (12.59) as 


Og' 

Ot 

Note that T*’, which stands for the kth Cartesian component of the 

momentum flux density vector T’, is the (ki)-element of the energy tensor 

T"’ (ax). If we multiply either side of the above equation with e; and sum 

over i, and make use of Eq. (5.68), the momentum conservation is expressed 
compactly in the form: 


+V-T'=0, i=1,2,3. (12.61) 


Og a 
—~+V-T=0, 12.62 
at (12.62) 
where T is the momentum flux density 3-tensor, and comprises the space— 
space components of the energy tensor. 
The energy tensor is strictly symmetric, i.e. 


T! (2) = T(x). (12.63) 


The symmetricity of the space-space components is related to the 
conservation of angular momentum (see [34], p. 170). The symmetric- 
ity between the space-time, and the time-space components is hidden in 
(12.57) and (12.60), as we shall now show. 

The mass energy equivalence EF = mc 
density w(a) an effective mass density o(a) such that 


? associates with every energy 


w(x) = o(x) c?. (12.64) 


The energy flux density S(a) suggests an equivalent transport density v(a) 
such that 


S(x) = w(x) v(2). (12.65) 


In the case of the electromagnetic field, this transport velocity is the 
same as the velocity of light. In the case of a perfect fluid in which pressure 
exists, it is however different from the velocity u(«) of the fluid, as we shall 
see in the next section. 
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Also, momentum p and the effective mass m are connected by the rela- 
tion p = mv. This is true for relativistic and non-relativistic systems. It 
is obviously valid for a particle if we take m as the relativistic mass. It 
holds for a photon if we interpret E/c? as the effective mass of the photon. 
Extending the above mass-momentum relationship to the corresponding 
densities, we get g = ov = Sv. Use of (12.65) now leads to the following 
relationship between g and S: 


ge 58(2), (12.66) 


It now follows from (12.57) and (12.60) that T°* = T*°, 
To summarize: 


Te = T** = (ki), (ik) 


= component of the momentum flux density tensor T. (12.67) 


12.8. Energy Tensor of a Perfect Fluid 


We shall take up another simplified model of a closed system to shed further 
light on the meaning of energy tensor. We shall consider a gas, as in the 
previous section, but drop the assumption of tenuousness, so that the gas 
particles can be imagined to interact through collision, thereby giving rise 
to fluid pressure. 

When a fluid is compressed, its energy increases. Therefore, the fluid 
under pressure has extra elastic energy which must show up as extra mass 
density. In order to keep the discussion simple, we shall assume (a) that 
the fluid is perfect, so that the stress field inside has the simplest possible 
form, as given by Eq. (5.81), and (b) the heat exchange can be ignored 
so that the dynamical processes can be treated as adiabatic. We shall 
present a simplified treatment for this case. For a rigorous treatment, with 
a proper analysis of the stress tensor inside the fluid, the reader may look up 
[34, Sec. 6.6]. 

As in Sec. 12.5, consider the change in the energy-momentum content 
of a given number 6N of particles confined within a variable volume OV, 
as these particles move in bulk as a stream. The rest mass of this volume 
is oo(x)dV,, whereas its relativistic mass in the observer’s frame (which 
includes the mass [?a,(x)dV as given in Eq. (12.38) plus some extra mass 
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generated in this volume due to the work done by the pressure force) is 
a(x)dV. 

As the fluid moves, the energy content of 6V changes due to the work 
done along its surface by the surface force. Consider an area element 6a 
ata point P on the surface, as in Fig. 12.2(a). The force on this area is 
—pn oa, where n is an outward normal. The surface element is moving with 
the velocity u under the pressure force. The work being done by this force 
is —pn-uda per unit time. The net work on the closed surface per unit 
time is the surface integral of this elementary work, which we shall convert 
into volume integral using Gauss’s theorem, to obtain the rate at which the 
energy of the volume V is increasing per unit time. 


a= [f -m-nde= [ff —v-(u)a'r (12.68) 


Taking the above volume V to be very small, equal to 6V, and dividing 
both sides of the equation by dV we get the rate of change of energy per unit 
volume per unit time (same as what we called power density in Sec. 12.2) as 


w= —V- (pu). (12.69) 
Interestingly, we can expand the above expression into two parts: 


mw, =—Vp-u, (12.70a) 
m2 =—p(V-u). (12.70b) 


Equation (12.70a) shows how the pressure volume force —V>p, as given by 
Eq. (5.82), alters its kinetic energy per unit volume per unit time, and 
Eq. (12.70b) shows how the expansion of the gas against the external pres- 
sure contributes an elastic energy (same as the potential energy) per unit 
volume per unit time. 

We now write the energy equation, noting that oc? is the energy density. 
The rate of change of energy inside the volume dV is equal to ~ dV. Hence, 
by (12.69) 


e“Ig dV] = wdV = —V:- (pu) OV. (12.71) 
By Eq. (12.28) 
d Oo 
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Hence, the time component of the EoM: 


O(c?) 


Equation (12.73) represents the 4 = 0 component of the fundamental con- 
servation equation (12.55) and is equivalent to (12.56), and then to (12.58). 
Comparing, we identify the T“° components of the energy tensor: 


T(x) = w(x) = ace’, (12.74a) 
T(x) = ~s*(0) - (o a =) uke. (12.74b) 


Before writing the momentum equation of motion, we shall need an 
expression for the momentum density g in the frame S. This follows 
straight from Eqs. (12.66) and (12.74b). 


P 
g(x) ==> = (< # 5) u. (12.75) 
The momentum content of the volume 6V is therefore, 
6 eo Pp 
pe ( aS 5) udv. (12.76) 


To obtain the rate of change of this momentum we shall use Eq. (12.31), 
replace o with (o + <): 


= 2 E {(o+ 4) ub+v-{(o+4) uu} | sv. (12.77) 


The only force acting on this fluid element, as already assumed, is the 
pressure force, namely, 


6F = —(Vp) ov. (12.78) 


The EoM for momentum is 


dop _ 


— =F. 12. 
Ti (12.79) 


Therefore, from (12.77)—(12.79) 


2 {(o+2)ub+v-{(o+4) m+ ip} <0. (12.80) 
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Equation (12.80) is to be identified with (12.59). Hence, the remaining 
components of the energy tensor: 


T (a) = ( + ) uke, (12.81a) 


Zs R|'s 


Ti(e) = ( oi ) uiuk + dip, (12.81b) 


The components of T“” shown in (12.74) and (12.81) do not present a 
covariant expression, because the quantities on the right-hand side are not 
4-vectors or 4-scalars. We shall correct this defect. 

Let us go back to the IRS S, in which the components of T"” form the 
following diagonal components, by setting u = O in (12.74) and (12.81): 


ac 0 0 


0 o O O 
T rest (X) 0 a D 0 (12.82) 
0 0 0 po 


where p, is the pressure in S,. According to Corollary #3 in Sec. 8.7.4, the 
pressure p is Lorentz invariant, 1.e. 


P= Do: (12.83) 


The relativistic mass density o(a) has to be expressed in terms of o,(). 
For this purpose, note that 


POY OO ie, (12.84) 
where © represents the Lorentz transformation matrix corresponding to 
the boost: {—-u = —@c} from the rest frame S, to the observer’s frame S. 


Replacing 6 by —8 in Eq. (3.20) of Sec. 3.2 we get the relevant components 
of the LT: 


O20: 08. =T ae (12.85) 


Substituting (12.85) in (12.84), and recognizing the T°° component from 
Eq. (12.74a) we get the following expression for o(): 


oc? =T*ooc? + T7(82 + eo, + 82)po 
= Toc? + 178709 


= Tope” + (1? —1)pp. 
Hence, 


(12.86) 
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Substituting (12.83) and (12.86) in (12.74) and (12.81), we transform T#” 
to the following form: 


00 _ 2 Po Po) 2 
T® = {I? (oo +) Bpo 


po — Te — 7? (0 " *) cut, (12.87) 
Cc 


T* —[? (0 + ) urue +90", 


It is now obvious from Eq. (12.87) that the energy tensor for a perfect 
fluid has the following compact covariant expression: 


Hy (2 +4 +) U"UY — pg”, (12.88) 
Cc 


where g'” is the metric tensor and U(x) is the 4-velocity field of the fluid. 


Note that we have dropped the subscript “,” under p, because p = po. 

Compare the energy tensor of the perfect fluid with that of the incoher- 
ent dust shown in (12.46), and note the difference between the two energy 
tensors: 


LTV 
THY = THY _ DY =p (= = i”) . (12.89) 


The extra mass—energy—momentum represented by 6T"” is entirely due to 
fluid pressure. 
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A.1. Energy Conservation in Electromagnetic Field 


We shall prove Poynting’s theorem as given in Eq. (12.9) using Maxwell’s 
equation (11.39). The electric current density appears in Eq. (11.39b). 


BJ =Beage{V x cB (rt) — S| 


cot 
However, E-V x B=B-VxE-V-(ExB) (an identity). 
E (r,t 
Hence, B-J = eye {B-V x B~V- (Bx B)} ~ pn Ht) 
OcB OE (r,t 
= e9cB : (-) = V ; eoc?(E x B) = eg E : ee) 
_ _& Oo 24 .2R2)_~. 
= aE + c*B°) —V-eoc(E x cB) 
Ow 
=—-.—-V°‘S. 
Ot (QED) 
A.2. Examples of Lowering and Raising an Index 
Example 1. 
ee 1 0 O 0 
Vi=v" y} 0 -1 0 
ee | ge) ie og a 0 
ve 0 0 O -l 
= (V°, -V1, -V?, -V3) (A.1) 


343, 
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1 0 O 0 
0 -1 O 
Al = gt” A, = -_ ‘ (a Al, A2, AS) 

0 0 O =I 

Ao 

_Al 

=| (A.2) 

— As 


Lowering or Raising = No change in the time component. Sign change in 
the space component. 


Example 2. Let 


foe f°? fo? f? 

7 5 ae fit f? 5 a 
“—- f20 fal p22 423 (A.3) 

fo f* Sian f= 


be a contravariant 4-tensor. We shall lower only the first index jz, then only 
the second index v, then both indices p, v. 


1 0 0 foe fo f° f? 
0 —1 0 10 11 12 13 
ie = Guak” = ‘i f f f 
0 0 —] 0 F270 ft f? f?? 
0 0 0 | f° fo f°? f? 


— f* — fet = foe f8 
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First index lowered = No change in row 0. Sign change in rows 1, 2, 3. 


f°? Fur f” fe 1 0 0 0 

oe ge _ fre po fP fs 0 =1 0 0 

es Jav = 20 21 22 23 0 0 -1 +O 
oan ake aaa 3 

f° f?! f°? f? 0 0 0 —] 


: a =f? — f° 
fie ssi fehl _f? —7is 
f?° — frt — f?? a ia 
f° — fit —f? —f 


Second index lowered = No change in col 0. Sign change in cols 1, 2, 3. 


0 0 0 foo fol __ 02 __f08 
=f 6 0 flo fll _ 12 _ £13 
p20 f2l £2223 
0 —-1 fo f8t p82 p83 


Foo —f" —f” —f? 
= 710 fu fv f 
= 20 fer f?? f?? 
a for f?? f? 


Fuv = Quek = 


ooo 
| 
ar 


Both indices lowered = No change in {00, kj, 7k} components. Sign change 
in {0k, k0} components. 


Example 3. Trace of the contravariant tensor fF” is defined as Ff’); sum 
over jt. Going back to (A.5), 


Tr{F} = FY = fo —(f + f?? + f°) 


= sum of the diagonal elements of F". (A.7) 
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A.3. Components of Maxwell’s Stress 3-Tensor and 
Maxwell’s 4-Tensor, and Their Traces 


Maxwell’s 3-tensor was written in a short form in Eq. (6.46). We shall now 
write down the 3 x 3 components of this tensor. The reader should verify 
them. 


Tae = 2|(B2 — BE? — 8?) + 2(B? — B? — B?)), 


9 £ 7] az y 
Tall = > ((E} - B} - £2) + 2(B} - B2 - BD), 
ZZ £0 
Tem = 5 [(Ez BoB) ee (8, = BL BI, (A.8) 


Tot =T28 = eo|EeE, + BeB,), 
Tee = Tad = eo[EyE, + &° ByB.), 
Toe = Te = eB Ee He BB zl. 


We can now write the trace of the Maxwell 3-tensor: 
es E 
Tr{Tom} = Tat + Ta + Ta = -2(E? +B). (A) 


Maxwell’s 4-tensor was defined by Eq. (12.17). We shall use the same equa- 
tion to identify all the components of M"” (x). 


1 
From (12.17a): = G ge s) = Vai, 
c \ ot 
or a - 2 (S;/¢) = ou ~ M?° (sum over j) 
cot Oni 7" Ot Oa vu 
Hence, M™=w; M=S;/c. (A.10) 
In the following, we shall write 61,62 ...., to mean 627 674... 
respectively. 
Og es ak 
From (12.176): [52+ Blom] =VoM™s k=1,2,3, 
k 
Ocgr , O aap OM 9a ‘ay J Sum over j 
a = Oe + St"), 
= cot si Bat | em) ct " Dai | ) k=1,2,3. 


Hence, M°* = cg,; MI* = &3* , (A.11) 


Appendices 347 


We can now write all the 4 x 4 components of M"" (a). 


0 1 2 3 


0 W Cox Coy Cz 
2] S/o @21, 22 O23 | 
3 S./e 31, 632 933 


Because of Eq. (12.13), cg = *, and the tensor is symmetric. 


a? 


The trace of the Maxwell’s 4-tensor follows from (A.7) and (A.9). 


Tr{M} = M¥, = M® — (M+ M” + M®) 


= (Ot Oo Oo) 


= w+ (Tad + TH + Taz) = w— 2 (B+ 2B) = 0. (A.13) 


B.1. Useful Integrals 


We shall write derive the values of some integrals required in this book. The 
integrands of all the integrals will have in their denominators integer /half- 
integer powers of the expression (r?+a?—2ra cos @), the integration variable 
will be 6, and the range of integration [0,7]. We shall do some preliminary 
work by changing the variable of integration from # to 7, accompanied by 
the change of the limits of integration, and conversion of the numerators 
for the first two cases: 


n’ =r’ +a? — 2ra cos8, (B.1a) 
ndyn = ar sin@ dé, (B.1b) 
B® pg? 
a—reos§ = (B.1c) 
22) (92 1 2\ 2 
(r? +42) cos —2ra =P) FT (B.1d) 
ra 


Lower limit: 0=0>7={(a-—r), ifa>r}; {(r—a), if} r >a}. 
(B.1e) 


Upper limit: @=7>n=a+r. (B.1f) 
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Direct Evaluation 


Using the above conversions hints it should not be difficult for the reader 
to establish the following integrals: 


Integral #1 


a 
ee (et mia ge @ oO CGH 
(r? + a? — 2ra cos 6)? 0 ifa<r. 


(r2 + a?) cos — 2ra 


(r2 + a? — 2ra cos6)° 


Wo(r,a) = [ 


Evaluation using Maxima 


| sin 6 dO = 0. (B.3) 


We have evaluated the following three integrals, using Maxima (version 
5.13.0). We shall first write down the values of the integrals, and then 
show the commands used in Maxima to obtain these results. Let us write 


a=2cos6; 8 = 2(r? + a”) cos6 — (3 + cos? A)ar. 


B.4 
y=sind; 6 =(r?—2a? +arcosé) sind. ee 
Integral #3 
W3(r, a) = i [(a? — 7”) cos 6 — 2a7ysin 6] sin dé = 0. (B.5) 
0 
Integral #4 
™ | (a8 — 6) cosé — (ad + By)sind] . 
wv = —————— 6d 
a(r,a) [ | (r2 + a? — 2ra cos 6)5/2 aie 
12r 
0 (a <r) 


Integral #£5 


_ [7 | (6? — 67) cos9 — 285sinO| _ 
UWs(r, a) = | eee sin 6 dé = 0; (B.7) 
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Maxima Commands, Inputs and Outputs 


We shall write the interactive commands and prompts between the user 
and the Maxima so that the reader can verify the values of the integrals 
#4 and #5. Note the following: 


1. Some output lines (e.g. %05, %o6 in Ex. #4) are spread over two lines in 
which the first line contains the “indices”, e.g. “to the power 2”. These 
indices get displaced and detached from the base when the output is 
copied into any text file. To avoid this anomaly, we have brought them 
to one line using mathematical mode. 

2. If the output is an expression of a definite integral, it is spread over seven 
lines (e.g. as in %09 in Example #4), and the integral sign becomes 
unintelligible when copied. We have replaced these outputs and other 
outputs that appear too long and complicated with “...”. All outputs 
except the final one are non-essential. 


Input/Output for Integral #4 


(%il) aa: 2*cos(x); 
(%o1) 2 cos(x) 
(%i2) bb: 2 *( r~2 + a7 2) *cos(x) — ( 
(%o02)  2(r? + a”) cos(x) — ar(cos?(x) + 3) 
(%i3) cc: sin(x); 
(%o3) — sin(x) 
(%i4) dd: (r~ 2 —2* a~2 + a* r *cos(x) )*sin(x); 
(%o4)  (arcos(x) +r? — 2a?) sin(x) 
(%i5) — f: (aa*bb — cc*dd)* cos(x) — ( aa*dd+bb*cc)* sin(x); 
(%o05) cos(x) (2 cos(x) (2(r?2 + a7) cos(x) — ar(cos?(x) + 3)) 
— (ar cos(x) + r? — 2a”) sin?(a))— sin(x) 
((2(r? + a?) cos(x) — ar(cos?(x) + 3)) sin(x) 
+ 2 cos(x) (a r cos(x) + r? — 2a? ) sin(x)) 
%i6) et: abs(sqrt(r” 2+a~ 2 — 2*r*a*cos(x))); 
%o6) sqrt(— 2 ar cos(x) + r? +a? ) 
%i7) hi: (£/(et~ 5))*sin(x); 


%o08) [a> tr 


( 
( 
( 
(%i8) assume (a—r > 0); 
( 
(%i9) integrate (h, x); 

( 
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(%i10) 


(%o10) 
(%il1) 


(%o11) 
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changevar (%, et — y, y, x); 

Is y positive, negative, or zero? 

pos; 

solve: using arc-trig functions to get a solution. 
Some solutions will be lost. 


%,nouns; 
Is sqrt(r + 2ar +a) — sqrt(r — 2 ar + a) positive, negative, 
or zero? 


pos; 
Is r + a zero or nonzero? 
nonzero; 
sqrt(r? — 2ar + a?)(48a?r" + 36a3r® + 8atr® + 4a°r*) 
~( r—a 
sqrt(r? + 2ar + a”)(48a?r7 — 36a°r® + 8a4r> — 4a5r*) 
r+a 


+sqrt(r? + 2ar + a”)(—48a2r® + 12a%r> — 4atr*) 


—sqrt(r? — 2ar + a”)(—48a?r® — 12a3r> — da*r*t))/(16a®r°) 


To simplify the last output (%o11), set 


and get 


a—r) if, (a>r 
ee enh : - : if. : : i fae 
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po = PF for the first case and 0 for the second. 


Input/Output for Integral #5 


bb: 2*(r~ 2 + a” 2)*cos(x) — ( (cos(x))~ 2 +3)*a*r; 

2 (r?+ a?) cos(x) — ar (cos?(x) + 3) 

dd: (r~ 2 —2*a~2 + a*r*cos(x) )*sin(x); 

(a r cos(x) + r?— 2 a?) sin(x) 

f: (bb~ 2—dd~ 2)*cos(x) — 2*bb*dd*sin(x); 

cos(x) ((2 (r2+ a”) cos(x) — ar (cos?(x) + 3))? 

— (ar cos(x) + r?— 2 a)?sin?(x)) — 2 (a r cos(x) + r?— 2 a?) 
(2 (r?+ a”) cos(x) — ar (cos?(x) + 3)) sin?(x) 

ets: r~ 2+a~2 — 2*r*a*cos(x); 

— 2arcos(x) + r?+ a? 


(%i5) 
(%05) 


(%o9) 
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h: (f£/(ets~ 5))*sin(x); 

(sin(x) (cos(x) ((2 (r?+ a”) cos(x) — ar (cos?(x) + 3))? 

— (ar cos(x) + r?— 2 a”)?sin?(x)) — 2 (a r cos(x) + r?— 2 a?) 
(2 (r7+ a”) cos(x) — ar (cos?(x) + 3)) sin?(x)))/(— 2 ar cos(x) 
4 24 a2)? 

assume (a—r > 0); 

la >t] 

integrate (h, x, 0, %pi ); 


changevar (%, abs(sqrt(ets)) — y, y, x); 

Is y positive, negative, or zero? 

pos; 

solve: using arc-trig functions to get a solution. 
Some solutions will be lost. 


%,nouns; 

Is sqrt(r?+ 2 ar + a”) — sqrt(r?— 2 ar +a”) positive, negative, 
or zero? 

pos; 

Is r + a zero or nonzero? 

nonzero; 


0 
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Epilogue 


In the high reaches of the Himalayas, the ice of the Gangotri glacier melts, 
and as it descends down the mountains into the plains, is swelled by the trib- 
utaries and groundwaters to form a mighty river, the Ganga. But Ganga 
is not just a river. It is also a concept, one of the pillars of a Faith that 
has moulded a civilization. It could be an edifying experience, intellectually 
and physically, to journey down the course of the river, from the mountain 
to the sea, as it meanders past holy cities and historical monuments. 

Nestled in the lofty heights of scientific analysis and philosophical rumi- 
nations, occasionally great and epoch making theories are born. Some of 
them cause deluge of such a magnitude as to shake everything on their way, 
altering the course of history, uprooting the foundations of conventional 
notions and concepts, and building on the ruins of destruction another edi- 
fice of much greater vigour and beauty. It can be an edifying experience 
to make an intellectual voyage downstream of the deluge and thrill at the 
profound upheavals that one intellectual feat of a human mind could bring 
about. 

Sometimes the impact is noticeable to all sections of society, as for 
example in the case of Faraday’s discovery of Electromagnetic Induction, 
leading to widespread use of electricity. More often, however, the impact is 
of a more subtle and esoteric nature, comprehensible to the avowed practi- 
tioners of the discipline. Newton’s formulation of Classical Mechanics and 
Gravitation falls in the latter category. It replaced the decrepit Aristotelian 
beliefs with a system of analysis whose mind-boggling profundity, universal- 
ity, power and simplicity laid the foundations of physics. Much of our stud- 
ies in classical physics is a journey through the course of a river that finds 
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its source in the laws of motion and gravitation as conceived by Newton, 
and streams through the macroworld of planets and satellites, the earthly 
world of missiles and locomotives, down to the microworld of atoms and 
molecules, as if to unify the three worlds in a single grand design. Even 
though the advent of Quantum Mechanics curbed the role of Newton in 
the microworld, and General Theory of Relativity redefined cosmological 
concepts, they are not to be regarded as an abandonment of Newtonian 
ideas, but a refinement of the same. The unifying spirit of Newton has 
been the driving force behind the pursuits of physics among all succeeding 
generations. 

In this book, we tried to trace the course of another great river, the 
Theory of Relativity, which finds its origin in the belief that all frames of 
reference are equal. Seen in isolation, this principle is just an extrapolation 
of the egalitarian value system from society to reference frames, without any 
visible impact. However, in juxtaposition with the laws of electrodynamics, 
this innocuous hypothesis makes startling revelations about the relative 
nature of space and time, leading to paradoxes of time dilation and length 
contraction (Chapter 2). An immediate fallout is Lorentz transformation, 
providing uniform prescription for the conversion of the time and space 
coordinates of an event between frames of reference (Chapter 3). 

The same Lorentz transformation does not seem to fit in within the 
scheme of Newtonian Mechanics unless momentum is redefined and the 
energy expression is modified. But this does not seem possible without 
recognizing mass—energy equivalence. The innocuous hypothesis then opens 
up an entirely new world where matter is annihilated to liberate energy. 
A new branch of physics is born — the physics of Atomic Nuclei and the 
physics of Elementary Particles with awesome forebodings of a nuclear 
holocaust (Chapter 4). 

The mathematical language in which the laws of physics, in particu- 
lar, the conservation of Energy and Momentum are expressed involve an 
exposition to the world of tensors. Maxwell’s stress tensor gives a beautiful 
illustration of the stress that exists in empty space, in the vicinity of electric 
charges and currents, or anywhere else where electromagnetic field exists 
(Part Il). 

The vision of relativity is incomplete unless physical quantities are 
looked upon as four-dimensional geometrical objects having one time and 
three space components. Realization of this four-dimensional world, called 
space-time, begins with the construction of the Minkowski metric giving 
the expression for a line element stretching between two events. The vision 
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of this four-dimensional scheme in the workings of the universe is then con- 
summated through the Principle of Covariance that declares that the math- 
ematical expressions of the laws of physics must be an equation between 
two 4-tensors of the same rank and type. This leads to the unification of 
energy and momentum within a 4-vector, the 4-momentum (also called or 
En-Mentum in this book), and Minkowski’s equation of motion (EoM) in 
terms of another 4-vector, the 4-force (also called Pow-Force in this book). 
An illustration of this new scheme is provided by the relativistic rocket, and 
the covariant equations of classical electrodynamics (Part III). 

Further example of the Principle of Covariance is provided by energy 
tensor, which not only illustrates the succinctness, elegance and the power 
of the energy-momentum (i.e. 4-momentum) conservation equations but 
also builds a bridge to the General Theory of Relativity, which is also the 
Relativistic Theory of Gravitation (Part IV). 

Within the limited scope of this book we have thus taken the reader on 
a guided tour of the Relativity Valley. Originating from a modest creed, 
nurtured by the Laws of Classical Electrodynamics, swelled by the conserva- 
tion laws of energy-momentum, embellished by the demands of Covariance, 
the relativity theory turns into a mighty river providing nourishment to all 
branches of modern physics. Midway downstream of our journey we had 
to call it a day as the river was about to enter a deep canyon of exquisite 
beauty — the domain of the General Theory of Relativity. Some of the 
readers who might have received an inspiration from their visit to the tem- 
ple of energy tensor for a rafting expedition down these turbulent rapids 
may now feel dejected at this last moment betrayal of their captain. How- 
ever, great expedition requires not only great courage, but also necessary 
equipments and above all, arduous preparations and training to face unex- 
pected challenges in a land of adventure. It is now time to undergo this 
training from a master of tensor calculus. We shall resume our voyage, 
hopefully in future, after period of interlude to give some sobering time for 
the impatient expeditioner. 
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conservation of 4-momentum, 249, 
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Einstein Summation Convention, 214 

Einstein tensor, 16 
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energy-momentum conservation, 106 
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hyperbolic world line, 200 


identity matrix, 215 

identity tensor,idemfactor, 151 
inertia tensor, 155 

inertial force, 7 

inertial frames, 9 

instantaneous rest frame, 84 
invariant rest mass, 95 
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relativistic mass, 95, 96, 239, 272 
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relativity principle, 9 
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time dilation, 36, 202 

time-like, 60, 211 

total energy, 103 
transformation of EM field, 300 
twin paradox, 114 


ultra-relativistic limit, 97 
unit lab volume, 321 
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Van de Graff accelerator, 99 

variable mass, 244 

vector, 218 

velocity addition formulas, 81, 83 

velocity transformation, 81 
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